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SYSTEMATIC POLYPEPTIDE EVOLUTION 
BY REVERSE TRANSLATION 



PTKT.P OF ™r T NVENTI ON 

We describe herein novel high-affinity 
polypeptide ligands that specifically bind a desired 
target molecule. A method is presented for selecting a 
polypeptide ligand that specifically binds any desired 
target molecule. The method is termed SPERT, an 
acronym for Systematic Polypeptide Evolution by Reverse 
Translation. The method of the invention (SPERT) is 
useful to isolate a polypeptide ligand for a desired 
target molecule. The polypeptide products of the 
invention are useful for any purpose to which a binding 
reaction may be put, for example in assay methods, 
diagnostic procedures, cell sorting, as inhibitors of 
target molecule function, as probes, as sequestering 
agents and the like. In addition, polypeptide products 
of the invention can have catalytic activity. Target 
molecules include natural and synthetic polymers, 
including proteins, polysaccharides, glycoproteins, 
hormones, receptors and cell surfaces, nucleic acxds, 
and small molecules such as drugs, metabolites, 
cof actors, transition state analogs and toxins. 

tt&rrePOUND ™ THE TNVENTION 

As translation of mRNA proceeds, stable 
complexes are formed. These complexes are made of 
ribosomes bound to mRNA with tRNA and nascent 
polypeptide encoded by the messenger RNA. Termed 
"ribosome complexes" herein, such complexes can be 
isolated by various known processes (Connolly and 
Gilmore (1986) J. Cell Biol. 103:2253; Perara et al. 
(1986) Science 232:348). Antigen-encoding mRNAs have 
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been purified by taking advantage of the 
^reactivity of nascent P™^££^. f 
with ribosome complexes (Sambrook, J. , Fritsch, ' 

^ir>itated from solution or separated by 
immunoprecipitatea t-j-^ „ nn -reactive 
rotein A co iumn chromatography from non-reactive 
protein A coiuiu. /1977) Nuc. Acids 

ribosome complexes (Schutz et al. (1977) n 
IZ 4, 71; Shapiro and Young (1981) J. Biol. Chem 
256* 1495). cyclical selection and amplif xcation of 

o, ribosome complexes _ 
nina of ribosome complexes according to the 
ZslTZZ^ I not restricted to • ^reactivity 
of the nascent polypeptides. 
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rn ^ P y <TF THE INVENTION 

[ ^ro^dest aspect, the method of 
syst ematic polypeptide evolution by 
(SPERT) includes a --^1^ 
having a randomized ammo acid sequence. 
™ mixture is linked to an individualized mRNA 
which encodes the amino acid sequence of that 

-., me >r>fcide The candidate polypeptides are 
^Snea' according to tneir property of binding to a 
T*T Lired target ooXecule. Tne parti* i«£ " 
Lrried out in such a way, herein described, that each 
30 ^ encoding a polypeptide is partitioned exactly 

ssr-rjsrs^ss =■- 
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further synthesis of the selected polypeptide as 
desired, and further amplification of the coding 
sequence. It is therefore not necessary to analyze the 
amino acid sequence of the selected polypeptide (using 
5 protein chemistry) in order to produce it in desired 

quantities. 

Viewed another way, the invention is the 
selective evolution of a nucleic acid that encodes a 
polypeptide ligand of a desired target. The present 
10 method is therefore a selection based upon coding 
properties available in a candidate nucleic acid 
mixture. In previously filed applications, U.S. Serial 
No. 07/536,428, filed June 11, 1990 and U.S. Serial No. 
07/714,131 filed July 10, 1991, both of which are 
15 incorporated herein by reference, the inventors herein 

have taught a method for selective evolution of nucleic 
acids based upon binding properties of the nucleic 
acids themselves. The insight that cyclical selection 
and amplification can be a powerful tool for developing 
20 novel compounds when coupled with a partitioning system 

is herein adapted to evolving specific coding nucleic 
acids, based on the partitioning properties of 
polypeptide ligands binding to target molecules. 

More specifically, the invention includes a 
25 method for making a polypeptide ligand of a desired 

target molecule which includes the following steps: 
First, synthesizing a mixture of translatable mRNA's, 
having certain sequence segments in common such as a 
ribosome binding site and a translation initiation 
30 codon and having a segment encoding a polypeptide at 

least part of which coding region is a randomized 
sequence. Second, employing the mRNA mixture in an in 
vitro translation system. Synthesis of nascent 
polypeptides ensues, each encoded by its own mRNA. At 
35 any time during translation, stable ribosome complexes 
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• • -, M It is preferred to isolate complexes 
can be isolated. It is P * -tailed" by 

In which translation has been stopped, or stalled 
nv of several Known circumstances. Each isolated 
any of several ribosome, one 

^ r: — - f rt — sr 

to nation, as is Kno^ l ; ^ ^ 
ribosome complexes are partition „ ot;il - ed target 

binding of each nascent polypeptide to a desired targe 

target parrs xs ther y ^ ^ ^ to 

£r l tnfc-plexe, ana amplified * conventional means 
f L amplifying nuoleio agios, such as ™° 
transcription an, P-^^^T^^ 
This amplification sets the stage ror 
round of transcription, polypeptide synthesis and 
^rtitioning to further enrich for ^"^^ as 
peptide ligands • ^--can ^ ^fLity 
TJZZfZZZZ. improvement in hinding 
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in vitro. 

in an alternate embodiment of the present 
invention, means for linking the nascent polypeptide to 
the translated mRNA are included in the design of the 
system. According to this method, a direct 
connection-either via covalent bonding or very tight 
affinity interactions-between the polypeptide and the 
mRNA allows for the removal of the ribosomal linkage 
between these two elements leaving mRNA* polypeptide 
copolymers. By removing the relatively large ribosome 
from the mRNA polypeptide copolymer, the ability to 
partition polypeptides based on the affinity of the 
randomized polypeptides to a given target may be 
greatly increased. In addition, the ribosome is then 
freed to translate additional mRNA species. The fewer 
ribosomes that can be utilized, the more randomized 
polypeptides can be generated in the process. In a 
specific example of this embodiment, a biotin molecule 
is covalently bound to the 5- end of the mRNA sequence 
utilized, and the nucleic acid template includes a 
fixed sequence in the translated region that encodes a 
polypeptide that may be covalently bound to biotin. 

The present invention provides a class of 
products which are polypeptides, each having a unique 
sequence, each of which has the property of binding 
specifically to a desired target compound or molecule. 
Each compound of the invention is a specific ligand of 
a given target molecule. The invention is based on the 
unique insight that cyclical selection and 
amplification of nucleic acids can be applied to coding 
sequences by partitioning such coding sequences 
according to the binding affinities of the encoded 
polypeptides. In vitro evolutionary selection can 
therefore be applied for the first time to up to about 
10 18 different polypeptides. Polypeptides have 
sufficient capacity for forming a variety of two- and 
three-dimensional structures and sufficient chemical 
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<-.litv available within their monomers to act as 
TTands rorTspecif ic binding pairs, with virtually 
Z T^^ — Homeric o r polymers 

—n^s raL in aqueous «*^«-«- 
of salt temperature and P H near acceptable 
; hy ;i": g ical limits. - other uses different binding 

"-^T^rST proves a method which i- 

TliKture of candidates ana step-vise iterates of 
structural improvement, using the same 
selection theme, to achieve virtually 
criterion of binding affinity and selectivity 

While not bound by a theory of operation, SPERT 
is based on the inventors- insight that within a 
nolvpeptide mixture containing a large number of 
'possible seguences and structures there is a wide range 
of binding affinities for a given target * 
poly peptide mixture comprising, « -*■•>■ - int> 
^randomized segment can have ao" candidate 
possibilities. Those which have the higher affinity 
constants for the target are most liKely to bind. 
After partitioning ribcsome complexes or 
^.polypeptide copolymers, dissociation of mEKA and 
ZJse rranscription/amplification/ transcription a 
"cond polypeptide mixture is generated by translation, 
enriched Z the higher binding affinity —didates^ 
Actional rounds of SPERT progressive^ favor the best 
tigands until the resulting polypeptide mixture is 
p Lminantly composed of only one or a few seguences 
These can then be individually synthesized and tested 
L binding affinity as pure ligands. One cycle of 
SPERT effectively achieves reverse translation, at 
least quantitatively. 
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The ability to rapidly select a single sequence 
or family of sequences from a huge number of candidates 
has been dramatically shown in the nucleic acid area, 
in U.S. Patent Application Serial No. 07/714,131 
(referred to herein, along with U.S. Patent Application 
Serial No. 07/536,428, as the SELEX Applications), 
nucleic acid ligands to a variety of targets- including 
both protein targets that are known to bind nuclexc 
acids and protein targets that are not known to bind 
nucleic acids-have been identified. In such 
application there is also a description of a 
mathematical analysis of the partitioning and cycling 
aspects of SELEX referred to as SELEXION. This 
mathematical analysis dramatically demonstrated that by 
cycling through the partitioning process a number of 
times at a moderate stringency it is possible to obtain 
the individual species in a randomized mixture which 
have the highest affinity to the selected target. 

in actual practice, the SELEX Applications show 
that although in some cases a single solution nucleic 
acid ligand may be identified, it is more often the 
case that a family of ligands is identified having 
similar affinity to the target. The family of ligands 
was shown to generally have the same three dimensional 
configuration and many conserved sequences. 
Surprisingly, in some cases where the target was a 
nucleic acid binding protein, the SELEX process was 
able to identify a ligand solution that had a higher 
affinity to the protein than the sequence that the 
protein binds to in nature. These results emphasize 
the practicality of "short cutting" the evolutionary 
process by screening a mixture containing a very large 
number of candidates. 

Cycles of selection and amplification are 
repeated until a desired goal is achieved. In the most 
general case, selection/amplification is continued 
until no significant improvement in binding strength is 
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achieved" on repetition of the cycle. The iterative 
^tion/a^ification method is sensitive enough to 
allow isolation of two sequence varxants xn a mxxture 
containing at least 65,000 sequence variants. The 
me thod could, in practice, be used to sample about 10 
different polypeptide species. There is no upper 
limit, in principle, to the number of dxfferent _ 
Polypeptides which could be sampled, only a practical 
Ximxt dictated by the sizes of reac ^" S ^ S 
other containers necessary to perform the method. The 
polypeptides of the test mixture include a randomxzed 
sequence portion as well as conserved sequences as 
"d for combining with other functional domaxns or 
to provide sufficient polypeptide length to xnsure that 
the randomized sequence is accessible to the target xn 
the ribosome complex or mRNA-polypeptxde copolymer. 
Aad.no acid sequence variants can be produced xn a 
number of ways including chemical or enzymxc synthesxs 
of randomized nucleic acid coding sequences. The 
variable sequence portion may contain fully or 
partially random sequence; it may also contaxn 
subportions of conserved sequence incorporated wxth 
randomized sequence. Sequence variation in codxng 
nucleic acids can be introduced or increased by 
mutagenesis before or during the 
selection/amplification iterations. 

in the case of a polymeric target, such as a 
protein, the ligand affinity can be increased by 
applying SPERT to a mixture of candidates comprxsxng a 
first selected polypeptide sequence combined wxth a 
second randomized sequence. The sequence of 
selected ligand associated with binding or subportxons 
thereof can be introduced into the randomized portxon 
of the amino acid sequence of a second test -xture 
The SPERT procedure is repeated with this second test 
"xture Jisolate a second polypeptide ligand, havxng 
two sequences (one being the first polypeptxde Ixgand) 
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selected for binding to the target, which has increased 
binding strength or increased specificity of binding 
compared to the first polypeptide ligand isolated. The 
sequence of the second polypeptide ligand associated 
with binding to the target can then be introduced near 
the variable portion of the amino acid sequence after 
which cycles of SPERT results in a third polypeptide 
ligand. The third polypeptide ligand also contains the 
first and second ligand previously selected. These 
procedures can be repeated until a polypeptide ligand 
of a desired binding strength or a desired specificity 
of binding to the target molecule is achieved. The 
process of iterative selection and combination of 
polypeptide sequence elements that bind to a selected 
target molecule is herein designated "walking," a term 
which implies the optimized binding to other accessible 
areas of a macromolecular target surface or cleft, 
starting from a first binding domain. Increasing the 
area of binding contact between ligand and target can 
increase the affinity constant of the binding reaction. 
These walking procedures are particularly useful for 
isolating novel polypeptides which are highly specific 
for binding to a particular target molecule. 

A variant of the walking procedure employs a 
ligand termed "anchor" which is known to bind to the 
target molecule at a first binding domain (See Figure 
8). This anchor molecule can in principle be any 
molecule that binds to the target molecule and which 
can be covalently linked directly or indirectly to a 
small bridge molecule for which a peptide binding 
sequence is known. When the target molecule is an 
enzyme, for example, the anchor molecule can be an 
inhibitor or substrate of that enzyme. The anchor can 
also be an antibody or antibody fragment specific for 
the target. The anchor molecule is covalently linked 
to the bridge molecule, chosen to bind an oligopeptide 
of known sequence. A test mixture of candidate 
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polypeptides is then prepared which includes a 
r^LLed portion and includes also the Known sequence 
that binds the bridging molecule. The bridging 
nolecule binds the polypeptides to the -^t^olecule 
in the vicinity of the anchor binding site. SPERT 
then applied to select polypeptides which bind a 
Uo, the target molecule adjacent «-~ 
binding site. Polypeptide ligands which bind to the 
target are isolated, walking procedures as described 
1Z can then be applied to obtain polypeptide ligands 
with increased binding strength or increased 
specificity of binding to the target, walking 
procedures could employ selections for binding to the 
Lohor binding site itself or to another part of the 
target itself. This method is particularly useful to 
S. polypeptide ligands which bind at a particular 
site within the target molecule. The anchor acts to 
ensure the isolation of polypeptide sequences which 
fcind to the target molecule at or near the binding site 

of the anchor. 

Screens, selections or assays to assess the 
effect of binding of a polypeptide ligand on the 
Action of the target molecule can be readily combined 
with the SPERT methods. Specifically, screens for 
inhibition or activation of enzyme activity can be 
combined with the SPERT methods. 

in more specific embodiments, the SPERT method 
provides a rapid means for isolating and identifying 
polypeptide ligands which bind to nucleic acids and 
proteins, including enzymes, receptors, antibodies, and 
glycoproteins . 

in another aspect, the present invention 
provides a method for detecting the presence or absence 
of and/or measuring the amount of a target molecule m 
a sample, which method employs a polypeptide ligand 
which can be isolated by the methods described herein. 
Detection of the target molecule is mediated by its 
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binding to a polypeptide ligand specific for that 
target molecule. The polypeptide ligand can be 
labeled, for example radiolabeled or enzyme linked, to 
allow qualitative or quantitative detection, analogous 
to ELISA and RIA methods. The detection method is 
particularly useful for target molecules which are 
proteins. The method is more particularly useful for 
detecting proteins which are known to be only weakly 
antigenic, or for which conventional monoclonal 
antibodies of a desired affinity are difficult to 
produce. Thus, polypeptide ligands of the present 
invention can be employed in diagnostics in a manner 
similar to conventional antibody-based diagnostics. 
One advantage of polypeptide ligands over conventional 
antibodies in such detection methods and diagnostics is 
that polypeptides are capable of being readily 
synthesized in vitro or after cloning, since the method 
of the invention concomitantly selects the means for 
amplification, e.g., coding nucleic acids, along with 
the ligand itself. Alternatively, the polypeptide can 
be chemically synthesized since its amino acid sequence 
can be ascertained readily from the nucleotide sequence 
of its coding mRNA. A SPERT-generated polypeptide 
ligand need not be as large as an antibody molecule. 
Another advantage is that the entire SPERT process is 
carried out in vitro and does not require immunizing 
test animals. Furthermore, the binding affinity of 
polypeptide ligands can be tailored to the user's 
needs. Compared to antibodies, SPERT-generated ligands 
have much greater versatility. Conventional antibodies 
are immunoglobulins, which, although capable of a large 
repertoire of binding affinities, are nevertheless 
variations of a narrow amino acid sequence and 
structural theme. SPERT-generated polypeptide ligands, 
in contrast, are unlimited as to structural type, and 
therefore have virtually unlimited potential for 
binding. 
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Polypeptide ligands of small molecule targets 
useful as diagnostic assay reagents and have 
Terl I^ses Is seguestering agents, 

/. wHfiers of hormone action. cataiytn- 

state analogs of an enzyme catalyzed re 
catalytic polypeptides can be selected. Catalytx 

=rr— : 

PollacJc, S.J. et al. (X989, Meth. Enzymol. lZa-55! 
568> " „ yet another aspect, the present invention 

isolated by SPERT. Polypeptide ligands wm 
t-ar-c molLule are screened to select those which 

^fying'the .notion ^—J^^ 
to target Macules hich are^rot ins 

rn^rrr;: r^s: — — v r 

nh^it enzyme catalysis, m this case, an amount - 
the selected polypeptide molecule which » 
for target protein inhibition is combined with the 
t^rctem to aohieve the desired inhibition. 

The term "reverse translation" is used 
^rougJut as shorthand for the concept or information 
" flow from polypeptide sequence to nucleic acid 
sequence, ^he phrase and shorthand ma*e reference to 
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the original and revised "central dogma" pronounced by 
Francis Crick many years ago. Crick understood and 
articulated the idea that either RNA or DNA could serve 
as a template for the synthesis of complementary 
5 nucleic acid sequences, and that chemically either RNA 

or DNA could serve as a template for the synthesis of 
both RNA and DNA. Crick noted that proteins, comprised 
of strings of amino acids, were templated by nucleic 
acid but could not serve themselves as a template for 
10 the synthesis of nucleic acids. 

Most importantly, no simple chemistry is known 
that allows "reverse translation"; that was the basis 
nearly 25 years ago of Crick's adaptor hypothesis for 
using information in RNA to yield specified protein 
15 sequences during translation. 

SPERT has at its center a form 'of reverse 
translation that does not conflict with Crick- s 
postulates. While no process, no simple chemistry, is 
known that provides synthesis of a nucleic acid 
20 containing a sequence specified by a polypeptide (whose 

sequence is unknown to the scientist at the time of 
reverse translation) , SPERT provides a reliable 
mechanism for amplifying and using mRNAs that encode 
polypeptides of desired function but of unknown 
25 sequence. Techniques for binding one or a few 

polypeptides to a selected target are known in the art, 
although binding of a small number of polypeptides from 
a randomized pool of polypeptides is of no value by 
itself. It is the concomitant selection in the 
30 ribosome complex or mRNA- polypeptide copolymer of the 

mRNAs that encode those very polypeptides that provides 
a form of reverse translation because: 

1) the selected coding sequences can be 
amplified to yield large quantities of both DNA and 

35 RNA; 

2) the newly made mRNA can be used for 
synthesizing polypeptides, now a smaller set than the 
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:=£r^=£ :====- 

removed and^ polypeptides held in ribosome 
complexes or mPH,, polypeptide copolymers can be use. 
£or . S — \-rje\rl'e t ion.. during SPERT 
ooes not yTeld a nucleic aoia from only polypeptide 

but "reverse translation" does provide 
Ratification techniques, net syndesis of the 
Lplates fro, which the desired POlyP^e was 

=™; r^T^i.r.rr:i:^. 

the partitioning and activity of the desired 

.... i« ,n affective quantitative reverse 
20 £E£ ^provides the trials for subsequent 
rounds o« «~ ^ seguence _ b e used tc, deduce 
th e amino acid sequence of a selected polypeptide, 
polypeptide can then he synthesized hy chemical 
25 methods, if desired. 

r „ T ,„ n Pg CRTT-TT-r "CORES 

— — ! is a diagra-matic representation of 

st eps in the process of the invention. The top panel 
30 depicts a double-stranded DNA template having a T7 

promoter «^ ~, and a segment of — ^ 

rr^orir- ini^;i:;"si;e p of 

start codon, as a vertical 

35 Lorl til creates mPK>s (2 nd panel, which contain, 

from left to right, a ribosome binding site, a 
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a **• fixed sequence region, 
randomized sequence region, a 3 fixed q 

and a 3- primer annealing site. In vxtro t « nsla ^°" 

of this mixture gives rise to ribosome complexes wxth 

randomized nascent polypeptides (3rd panel). The 

ribosome complexes are subjected to selection for 

affinity of the nascent polypeptide and a desired 

t eflolecule (bottom panel). The encoding mRNAs of 

th e partitioned complexes are purified an ^.-b, acted to 

amplification, e.g., by reverse tran scnption, PCRand 

transcription, to generate mRNAs for a second cycle 

the P^^Y^ 2 . s a d . agram showing expanded views of 
a ribosome complex. The top panel is a ^™ 
complex as in the third panel of Figure 1. A cut away 
view of the ribosome (2nd panel) shows 30-40 ammo 
acids of the nascent polypeptide buried in the complex 
a^ unavailable for interaction with the solvent. The 
ribosome is depicted with two shades of gray to 
indicate inner and outer regions. The nascent 
polypeptide is depicted as a thicK white line extending 
vertically from a central tunnel (blac*) near the 
center of the ribosome. That portion inside the 
rihosome is depicted as 30-40 amino acids in length 
The carboxy-terminal end of the nascent polypeptide is 
shown connected to a peptidyl-tRNA (curly 
The region bordered by a dotted line is expanded in the 
bottom panel showing that the nascent polypeptide is 
covalently linKed to a transfer BNA molecule which is 
hydrogen-bonded to the mRNA at a codon in the P-site. 

Figure 3 is a diagram that represents 
partitioning polypeptide ligands by direct 
immunoprecipitation. The top panel is a ribosome 
complex as in Figure 1. The center panel depicts 
several ribosome complexes where the nascent 
polypeptide is represented as a short, thicK white line 
with hatching to indicate the segment of randomized 
sequence. Molecules of a first antibody 
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(iMu nogiobulin, are represented as 

sr^-Srii*. <* - - sh °- n 
r r — - — * p rn:r s are 

first immunoglobulin resulting i 

containing the selected ribosome complexes, shown 
minster in the left half of the panel. 

Figure 4 is a diagram showing partitioning of 
polypeptide Xigands by indirect ^"^T^^ 
The top pane! shows a target protein which has an 
IlinoreLtive domain ("handle", and a target domain 
(.pan.,. Three types of ribosome complexes are 
Lolcted in the second panel. These with no affinity 
r the ««« protein are shown in white. These with 
Iff inity for the "pan., are shown in light gray labeled 

a ip. and shown with a bound target protein 
Irtaohed by the "pan. to the nascent peptide. Those 
with affinity for the -handle, are dar* gray, labeled 
with an "H" and shown with a bound target protein 
attached by the -handle" to the nascent peptide. In 
the third panel, a first antibody (black lines) 
Srectee against the "handle" either displaces ligand 
a onions of the -„« complexes or those complexes 
are unreaotive. The first antisera form a sandwich 
^h the "P" complexes made up of a ribosome complex 
associated with the target protein, through its "pan , 
and bound to the first immunoglobulin ««^ 
-handle". These »P" complexes are immunoprecipitated 
by second antisera directed against the primary 
antisera, as shown in the bottom panel. 

Figure 5 is a diagram showing selection of 
polypeptide Uganda by membrane partitioning The top 
panel shows a ribosome complex as in Figure 1. The 
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middle panel shows ribosome complexes and membrane 
vesicles with membrane proteins. The membrane vesicles 
are depicted as a hatched band interrupted by hatched 
ovals that depict membrane proteins embedded in the 
membrane. In the middle panel, ribosome complexes are 
shown binding with membrane protein so that the nascent 
polypeptides having binding affinity for a membrane 
protein are partitioned. The bottom panel depicts 
three ribosome complexes bound to a membrane vesicle, 
forming a large complex which is separable from unbound 
ribosome complexes. 

Figure 6 is a diagram showing partitioning of 
polypeptide ligands by affinity column chromatography. 
Ribosome complexes (top panel) are passed through a 
column containing insoluble support materials to which 
have been bonded target molecules. The middle panel is 
an expanded view of the column showing support 
materials (hatched circular segments) with attached 
target molecules (black bars) to which some ribosome 
complexes are bound. The bottom panel shows, enlarged, 
a single ribosome complex in which the nascent 
polypeptide (light shading) is bound to a target 
xnolecule which is attached to a column support bead 
(hatched) . Ribosome complexes with high affinity to 
the target molecules are retained on the column and 
subsequently eluted to continue with SPERT. 

Figure 7 is a diagram showing anchoring of a 
binding epitope and secondary ligand evolution. A 
xnolecule ("inhibitor") of known affinity for a target 
site on a protein is covalently linked to a "guide 
epitope". The guide epitope is any molecule for which 
there exists a peptide ligand, including a portion of a 
monoclonal antibody which contains an epitope 
recognition domain (Fab fragment) . The mRNA encodes a 
reactive peptide sequence that binds the guide epitope, 
incorporated into the nascent polypeptide. The bottom 
panel depicts a ribosome complex having a nascent 



WO 93/03172 



PCI7US92/00801 



xnterest ay nt and by a secondary 

•^-^ «n fhe taraet protein of interest, 
a neianboring site on tne target ^ 

a neignD y „ ortion of the nascent polypeptide is 

z:t:zz ess— — — — - - 

^ CTr^'s is a diagram which shows the DNA to be 
transcribed and the relationships of the 
^nucleotides of Tables 1 and a in the » prior to 
inslrting the randomized sequence. The deprcted^ 
^ constitutes a cassette for carrying out the 
Ascription, translation, reverse transection and 
PCR processes used in SPEKT. 

in nmrrp nr"T T rT™" nr THF twvemtioh. 
EilWIB^^-^^T^rherein aecordxng 

" *- XeP^Tde is used herein to denote an, string 
of smino acTd monomers capable offing J^^s 
^^^T^LSSL -educed by Che. ical 

Amino acid analogs can be employed instead of 
"Ho naturally-occurring amino acids. Any amxno acrd 
analog that is recognized by an aminoacyl-tENA 
synthetase can be employed. Several «* 
*£wn, including fluorophenylalanine, norleucxne, 
sttidine-a-carboxylic acid. S -aminoethyl cystexne. 4- 
methvl tryptophan and the like. 

LZnu means a polypeptide that binds another 
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molecule " (target) . In a population of candidate 
polypeptides, a ligand is one which binds with greater 
affinity than that of the bulk population. In a 
candidate mixture there can exist more than one ligand 
for a given target. The ligands can differ from one 
another in their binding affinities for the target 
molecule. 

Candidate mixture is a mixture of nucleic acids 
and of polypeptides of differing sequence, from which 
to select a desired coding sequence and/or a desired 
ligand. The candidate mixture of nucleic acids serving 
as source of a candidate mixture of polypeptides can be 
in vitro transcription products of naturally-occurring 
nucleic acids or fragments thereof, chemically 
synthesized nucleic acids, enzymatically synthesized 
nucleic acids or nucleic acids made by a combination of 
the foregoing techniques. Target molecule means any 
compound of interest for which a ligand is desired. A 
target molecule can be a protein, fusion protein, 
peptide, enzyme, nucleic acid, nucleic acid binding 
protein, carbohydrate, polysaccharide, glycoprotein, 
hormone, receptor, receptor ligand, cell membrane 
component, antigen, antibody, virus, virus component, 
substrate, metabolite, transition state analog, 
cofactor, inhibitor, drug, controlled substance, dye, 
nutrient, growth factor, toxin, lipid, glycolipid, 
etc., without limitation. 

Partitioning means any process whereby ribosome 
complexes or roRNA* polypeptide copolymers bound to 
target molecules, termed complex-target pairs herein, 
can be separated from ribosome complexes or 
mRNA- polypeptide copolymers not bound to target 
molecules. Partitioning can be accomplished by various 
methods known in the art. The only requirement is a 
means to separate complex-target pairs from unbound 
ribosome complexes or mRNA- polypeptide copolymers. 
Columns which selectively bind complex-target pairs but 
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not ribosome complexes or mPNA-polypeptide -Polymers, 
(or specifically retain ligand to an immobilized 
target, can be used for partitioning. A membrane or 
IZlL fragment having tbe target on its surface can 
bind ligand-bearing ribosome complexes or 
^•polypeptide copolymers forming the basis of a 
mBMA poo-yp if article size. The choice of 

partitioning based on particle siz 

partitioning method will depend on properties of the 
Lrget and of the complex-target pairs and can be made 
forcing to principles and properties Known to those 
of ordinary skill in the art. 

amplifying means any process or 
process steps that increases the amount or 
copies of a molecule or class of molecules. Amplifying 
coding m*NA molecules in the disclosed examples is 
carried out by a sequence of three reactions-, making 
cDNA copies of selected mPHAs, using polymerase chain 
Jetton to increase the copy number of each cDHA and 
transcribing the cDNA copies to obtain an abundance of 
mRNA molecules having the same sequences as the 
selected mPBAs. Any reaction or combination of 
reactions known in the art can be used as appropriate, 
including direct DMA replication, direct mRBA 
amplification and the like, as will be -cognised by 
a.ose skilled in the art. The amplification method 
should result in the proportions of the 
mixture being essentially representative of the 
portions of different sequences in the mixture prior 

" ^"spe^binding is a term which is defined on 
a case-by-case basis. In the context of a 
interaction between a given ligand and a given target, 
a binding interaction of ligand and target of higher 
affinity than that measured between the target and the 
candidate ligand mixture is observed In order to 
compare binding affinities, the conditions of both 
binding reactions must be the same, and should be 
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comparable to the conditions of the intended use. For 
the most accurate comparisons, measurements will be 
made that reflect the interaction between ligand as a 
whole and target as a whole. The polypeptide Uganda 
5 of the invention can be selected to be as specific as 

required, either by establishing selection conditions 
that demand the requisite specificity during SPERT, or 
by tailoring and modifying the ligands through 
-walking" and other modifications using iterations of 
10 SPERT. 

Randomized is a term used to describe a segment 
of a nucleic acid or polypeptide having, in principle 
any possible sequence over a given length. Randomized 
nucleic acid sequences will be of various lengths, as 
15 desired, ranging from about twelve to more than 3 00 

nucleotides. The chemical or enzymatic reactions by 
which random sequence segments are made may not yield 
mathematically random sequences due to unknown biases 
or nucleotide preferences that may exist. Redundancy 
20 of the genetic code, and biases in the tRNA content of 

an in vitro translation system can introduce additional 
bias in the translated amino acid sequences, 
introducing a deliberate bias into a randomized coding 
region can reduce the bias of the resulting translated 
25 amino acid sequence. The term "randomized" is used 

instead of "random" to reflect the possibility of such 
deviations from non- ideality . In the techniques 
presently known, for example sequential chemical 
synthesis, large deviations are not known to occur. 

A bias may be deliberately introduced into a 
randomized sequence, for example, by altering the molar 
ratios of precursor nucleoside (or deoxynucleoside) 
triphosphates of the synthesis reaction. A deliberate 
bias may be desired, for example, to improve the 
randomness of amino acid sequence of translated 
polypeptides or to lower the frequency of appearance of 
certain amino acids. 
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For example, a randomized sequence biased for 
of the form ARN (where A is Adenine, R is 
codons of the form ak» i nucleotide) the most 

Adenine or Guanine and N is any nuc 

commonly encoded amino acids are ^ ^ ^ JJ ns 

nn1 „ f ser) . Randomized sequences biased for co 
£ ^JTS- are biased for acidic -no -. - 

(GM . GAC) ana «. ^ . - ^—rTsi-e 
^r^ir - 

strategies, randomized coding seguences can be 
K id for So type of structure likely to bind a given 

F or Lafple, polypeptide seguences biased for 
"lite amino acids can bind cationic target molecules 
« Lily than completely random polypeptides. 

Translatable mRNA is *NA which possesses all 
reouisite seguences for translation in a conventional 
i^itro translation system. These include, proper 
^iStarion and seguence proxi-al to the S- end of the 
^ a ribosome binding site and an initiation codon. 

bindil19 ^s'ome binding site means a nucleotide 
seguence in the mENA which functions as a binding site 
r „fa ribosome in an in stta translation system 
seguences which function as ribosome binding sites 
differ depending on whether the ribosomes are of 

is therefore usually located Within 5 - 12 bases fr 

ribosome binding site in the 3' direction on the 
^HA. These seguences are sometimes termed a Shine- 



15 



20 



25 



WO 93/03172 



PCT/US92/00801 



23 



Dalgarno sequence. The structures of ribosome binding 
sites and their proper placement to ensure correct 
initiation of protein synthesis are well known in the 
art. 

initiation codon is a characteristic 
trinucleotide sequence AUG which encodes methionine and 
which encodes a first amino acid of an encoded 
polypeptide and also sets the codon reading frame for 
the nucleotide sequence in the 3- direction from the 
initiation codon. 

Ribosome complex is a macromolecular complex 
including at least one ribosome, attached mRNA molecule 
and for each ribosome, a nascent polypeptide attached 
via'tRNA to the ribosome. The nascent polypeptide has 
an amino acid sequence encoded by the attached mRNA. 
Ribosome complexes are formed, as is known in the art, 
during protein synthesis. Ribosome complexes are 
stable if they become stalled for any reason, for 
example, by depletion of release factor, lack of 
termination codon in the message, lack of a charged 
tRNA, etc., as known in the art. The mRNA together 
with attached ribosome (s) and nascent peptide (s) remain 
stably bound and can be isolated together, using 
methods known in the art. 

mRNA- polypeptide copolymer is a macromolecular 
complex including an mRNA and a polypeptide having an 
amino acid sequence encoded by the attached mRNA. 
According to one embodiment of the invention, 
mRNA-polypeptide copolymers are formed by the creation 
of a candidate mixture in which the RNA includes fixed 
sequences and/or chemical modifications in both 
non-translated and translated regions so that a portion 
of the translated polypeptide will link with a portion 
of the mRNA via a covalent bond or tight affinity 
interaction. In other embodiments, the translated 
polypeptides or tRNA species utilized may be modified 
as well to facilitate the formation of mRNA-polypeptide 
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or mKNA-tRNA-polypeptide copolymers. 

in vitro translation can be carried out using 
These well-known translation systems 

rabbit reticulocyte system. The latter is 
oommercially. The conations for ^" ^fo^ 
translations are well-Known in the art , '™ 
modifications, adaptations and optimizations are 

polypeptide and An yiisn translation system constitute 
LpUfying Beans for amplifying the quantity of 
polypeptide enooded by the mRNA. The mRNA can itself 
be amplified using reverse transcription, PCR with 
appropriate primers and an RNA polymerase The 
amplified mRNA can serve for An yASra synthesis of 
desired quantities of the encoded polypeptide, 
noted, supra, this process constitutes reverse 
translation.^ ^ peptide „ _ 

conventional meanings Known in the art. The term 
"translated mRNA" simply refers to mRNA present in a 
ribosome complex, either wholly or partially 

tranSlat ibosome oomplex-target pairs are ribosome 
complexes of which the nascent polypeptide component is 
hound to a target molecule. The target 
free in solution or bound to a solid support matrix. 

Homology is used to compare the related uses of 
seguenoss. Percent amino acid sequence homology is 
measured by comparing sequences of equal length 
portion by position. The percent of those positions 
o^ild by the same amino acid in two polypeptides is 
^percent sequence homology. Thus, given peptide 
ABCDE as a naturally-occurring comparison peptide 
prides ABCDX or ABXDE are SO, homoiogous but peptides 
ABXYZ , AXyZE and XYZDE are 40% homologous and peptides 
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EDCBA, BDAEC, MNOPQ are non-homologous. 

The SPERT method involves the combination of a 
selection of polypeptide ligands which bind to a target 
molecule, for example a protein, with amplif icatxon of 
those selected polypeptides via the attached xnRNAs. 
Iterative cycling of the selection/ amplification steps 
allows selection of one or a small number of 
polypeptides which bind most strongly to the target 
from a pool which contains a very large number of 
nucleic acids and hence encoded polypeptides. 

Cycling of the selection/amplification 
procedure is continued until a selected goal is 
achieved. For example, cycling can be continued until 
a desired level of binding of the polypeptides in the 
test mixture is achieved or until a minimum number of 
polypeptide components of the mixture is obtained (m 
the ultimate case until a single species remains in the 
test mixture) . In many cases, it will be desired to 
continue cycling until no further improvement of 
binding is achieved. It may be the case that certain 
test mixtures of polypeptides show limited improvement 
in binding over background levels during cycling of the 
selection/ amplification. In such cases, the sequence 
and length variation in the test mixture should be 
increased until improvements in binding are achieved. 
Anchoring protocols and/or walking techniques can be 
employed as well. 

Specifically, the method requires the initial 
preparation of a test mixture of candidate 
polypeptides.. A translatable mRNA mixture is prepared, 
each member of the mixture including in its nucleotide 
sequence a ribosome binding site, an initiation codon 
and a randomized coding region. Preferably the 
individual mRNA 1 s contain a randomized region flanked 
by sequences conserved in all nucleic acids in the 
mixture. The conserved regions are provided to 
facilitate amplification of selected nucleic acids. 
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, h ^ e are ma ny such sequences known in the art, 
Z JZ o7^~ U one which those ordinary 
-*111 in the art can make, having in mind the desire 
:£nod" £ amplification. The randomized coding region 
caThave a fully or partialiy randomized sequence 
according to the desired translation product. 

^rtfofs^r random, aiong with suhportions 
which are held constant in all nucleic acid species in 
Se mixture. For example, sequence regions Known to 

amino acid sequences that 
Elected for binding, to the target can he integrated 
with randomized coding regions to achieve improved 
hiding or improved specificity of binding. 
variability in the polypeptide test f ^f 

introduced or augmented by generating mutations in 
coding MBNA 1 s during the selection/amplification^ 
process, in principle, the mSSA's employed in the test 
Lture can be any length as long as they can be 
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girled :i ZZ7 0 f the Present invention is most 
Really employed for selection from a large number 
of sequence variants. Thus, it is contemplated that 
the present method will preferably be employed to 
assess binding of polypeptide sequences ran,^ 
length from about four amino acids to any attainable 

SiZS ' The randomized portion of the coding nucleic 
acids in the test mixture can be derived in a number 
of ways. For example, full or partial sequence 
randomization can be readily achieved by — «* 
chemical synthesis of the nucleic acid <or P°™ 
thereof, or by synthesis of a template from which the 
nncleic acid (or portions thereof, can be prepared by 
use of appropriate enzymes. Chemical synthesis 
rrLiLs'the advantages of being precisely controllable 
as to length and allowing individual randomization at 
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each triplet position. A commercial DNA synthesizer 
can be used, either with an equivalent mixture of the 
four activated nucleotide substrates or with a biased 
mixture. Alternatively, the synthesizer can be set up 
to provide a limited nucleotide selection at a given 
position, e.g., only A at the first triplet position. 
End addition, catalyzed by terminal transferase in the 
presence of nonlimiting concentrations of all four 
nucleotide triphosphates can add a randomized sequence 
to a segment. Sequence variability in the coding 
nucleic acids can also be achieved by employing size- 
selected fragments of partially digested (or otherwise 
cleaved) preparations of large, natural nucleic acids, 
such as genomic DNA preparations or cellular RNA 
preparations. In those cases in which randomized 
sequence is employed, it is not necessary (or possible 
from long randomized segments) that the test mixture 
contains all possible variant sequences. It will 
generally be preferred that the test mixture contain as 
large a number of possible sequence variants as is 
practical for selection, to insure that a maximum 
number of potential amino acid sequences of the 
translated polypeptide are identified. A randomized 
sequence of 60 nucleotides will contain a calculated 
10 36 different candidate nucleic acid sequences which 
would encode 10 26 possible decapeptides . As a 
practical matter, it is possible to sample only about 
10 18 polypeptide candidates in a single selection. 
Therefore, candidate roRNA mixtures that have randomized 
segments longer than 60 contain too many possible 
sequences for all to be sampled in one selection. Many 
epitotes recognized by antibodies are only 5-10 amino 
acids in length. It is not necessary to sample all 
possible sequences of a candidate mixture to select a 
polypeptide ligand of the invention. It is basic to 
the method that the coding nucleic acids of the test 
mixture are capable of being amplified. Thus, it is 



WO 93/03172 



PCT/US92/00801 



preferred that any conserved regions employed in the 
test nucleic acids do not contain sequences which 
interfere with amplification. 

The practical considerations that limxt the 
number of candidates that may be sampled include the 
volume or mass of materials that can be handled xn a 
moratory environment. A system that operates to form 
ribosome complexes requires a stoichiometric amount of 
ribo some in the translation mixture. The presence of 
this quantity of ribosomes severely limits the amount 
of sequences that can be sampled - to about 10 to 
10 14 complexes. The production and isolation of 
quanitites of ribosomes in excess of these counts 
Lid be impractical. As E^pli has only about 10 
ribosomes per cell, a huge amount of E^Ii would be 
required to produce stoichiometric amounts of 
rlLomes. The limitation of M « to 10* complexes xs 
higher than the limitations found in other systems that 
have been devised for sampling large numbers of 
randomized polypeptides. However, when the rxbosome xs 
not bound up in the ribosome complex but is free to 
translate a large number of mRNA species in the 
reaction mixture, the number of mRNA species that can 
he practically tested at a time rises to at least about 
10 17 to 10 18 different candidate sequences, dependxng on 
the number of mRNAs translated by a single ribosome. 

The complex of a ribosome, mRNA, and nascent 
polypeptide attached to a tRNA in the P-site of the 
rib osome is very stable. Release of the nascent 
peptide from the complex and of the mRNA from the 
ribosome requires protein release factors. Release 
factor recognition requires the positioning of the stop 
codons of the mRNA in the A-site of the ribosome. In 
the absence of a stop codon or release factor the 
dissociation of the translation complex from mRNA xs 
very slow. The addition of the antibiotics 
cycloheximide (eulcaryotic systems) and chloramphenicol 
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(prokaryotic system) further stabilizes the complexes 
so that extensive manipulations like column 
chromatography and gradient centrifugation can be 
performed. 

In this embodiment a ribosome is preferably 
paused at the end of a coding sequence on a mRNA with 
the encoded nascent polypeptide available for 
partitioning of the complex. There are a number of 
ways in which this can be accomplished. Because stop 
codons are essential for release factor action, a 
translating ribosome that does not encounter any stop 
codons will proceed to the end of a mRNA and stall at 
the 3' end (Connolly and Gilmore, supra ) . In vitro 
translation systems which have been depleted of release 
factor (by immunoinactivation or mutation) will result 
in the stalling of translation complexes at stop 
codons. Removal of GTP, the use of non-hydrolyzable 
analogues, and the use of certain antibiotics will also 
stall translational complexes. The timed addition of 
these exogenous factors to a synchronous in vitro 
translation reaction can produce predictable sizes of 
nascent polypeptide for the successful partitioning of 
the translational complex. In some organisms there 
exist temperature-sensitive tRNA synthetase mutants. 
Another way of stalling translational complexes at 
defined sites is to include at the 3- end of the coding 
region a stretch of sense codons which are recognized 
by a single species of tRNA for which there exists a 
conditional tRNA synthetase mutant. In vitro 
translation reactions done from extracts of such 
mutants under the restrictive condition will result in 
stalled complexes at the stretch of sense codons for 
that particular tRNA. 

It will be understood that it is not necessary 
to stall or pause the translation process to obtain 
partitionable ribosome complexes. Stable complexes can 
be isolated at any time during active translation. It 
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is advantageous to isolate actively translatxng 
r ihosome complexes when it is desired to vary the 
length of the randomized segment, e.g., to test the 
effects of polypeptide length on binding effica ^; 
Rihosome complexes isolated during active translation 
constitute a population of nascent peptides of varxed 
length. By synchronously initiating translation and 
isolating ribosome complexes at various times 
thereafter, the effects of increasing polypeptide 
length can be compared. 

Polymerase chain reaction (PGR) xs an exemplary 
method for amplifying nucleic acids. Descriptions of 
PCR methods are found, for example in Saikx et al . 
(1985) Science 230:1350-1354; Saiki et al. (1986) 
Nature 324:163-166; Scharf et al. (1986) Science 
233-1076-1078; Innis et al. (1988) Proc. Natl. Acad 

L:9436-9440; and in U.S. Patent 4,683,195 (Mullxs 
et al.) and U.S. Patent 4,683,202 (Mullis et al.). In 
its basic form, PGR amplification involves seated 
cycles of replication of a desired single-stranded DNA 
(or cDNA copy of an RNA) employing specxfxc 
oligonucleotide primers complementary to the 3- ends of 
both strands, primer extension with a DNA polymerase, 
and DNA denaturation. Products generated by extension 
from one primer serve as templates for extensxon from 
the other primer. A related amplification method 
described in PGT published application WO 89/01050 
(Burg et al.) requires the presence or introductxon of 
a promoter sequence upstream of the sequence to be 
amplified, to give a double-stranded intermediate. 
Multiple RNA copies of the double-stranded P^oter- 
containing intermediate are then produced usxng RNA 
polymerase. The resultant RNA copies are treated wxth 
reverse transcriptase to produce additional double- 
stranded promoter containing intermediates whxch can 
them be subject to another round of amplif icatxon wxth 
RNA polymerase. Alternative methods of amplif icatxon 
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include among others cloning of selected DNAs or cDNA 
copies of selected RNAs into an appropriate vector and 
introduction of that vector into a host organism where 
the vector and the cloned DNAs are replicated and thus 
amplified (Guatelli, J.C. * al. (1990) Proc. Natl 
Acad Sci. 87:1874). In general, any means that will 
allow faithful, efficient amplification of selected 
nucleic acid sequences can be employed in the method of 
the present invention. It is only necessary that the 
proportionate representations of sequences after 
amplification reflect the relative proportions of 
sequences in the mixture before amplification. 

Specific embodiments, of the present invention 
for amplifying RNAs are based on Innis et al. (1988) 
supra. The RNA molecules in the test mixture are 
designed to contain a sequence transcribed from a T7 
promoter in their 5- portions. Full-length cDNA copies 
of selected mRNA molecules are made using reverse 
transcriptase primed with an oligomer complementary to 
the 3' sequences of the selected RNAs. The resultant 
cDNAs are amplified by lag DNA polymerase chain 
• extension, employing a primer containing the T7 

promoter sequence as well as a sequence complementary 
to the conserved 5- and of the selected RNAs. Double- 
stranded products of this amplification process are 
then transcribed in vitro. Transcripts are used m the 
next selection/ amplification cycle. The method can 
optionally include appropriate nucleic acid 
purification steps. 

in general, any protocol which will allow 
selection of polypeptides based on their ability to 
bind specifically to another molecule, i.e., a protein 
or any target molecule, can be employed in the method 
of the present invention. It is only necessary that 
the ribosome complexes or mRNA* polypeptide copolymers 
be partitioned without disruption such that the 
selected coding mRNA 1 s are capable of being amplified. 
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F or example, - * oolu- binding selection in which . 
t-t mixture of ri*osome complexes bearin , nascent 

a column partitioning system. Target binding 
PO^iL together with rn^s encoding each remain 
Lund to the column. The relative ^ ^T^L 
protein to test polypeptides in the incubated mixture 
influences the strength of binding that « selected 
for . «hen polypeptide is in excess, competition for 
Lilable binding sites occurs and those polypeptide, 
^ich bind most strongly are selected, aversely 
3hen an excess of target is employed, it is expected 
^t any polypeptide that binds to the ~»£TJ» 
selected. The relative concentrations of target to 
peptide employed to achieve the desired selection 
Lfdepend on the type of target, the strength of the 
liLing interaction and the level of any background 
Ending that is present. The relative concentrations 
Led to achieve the desired partitioning result can 
be readily determined empirically without undue 
experimentation. Similarly, it may be necessary to 
epLmize the column elution procedure to -nimize 
background binding, again such optimization of the 
elution procedures is within the skill of the ordinary 

"*"".» unexpected feature of the invention is the 
fact that the polypeptide ligand need not be elutable 
rem the target to be selectahle. This is because it 
Is the mENA that is recovered for further 
or cloning, not the polypeptide itself. It is Known 
that some affinity columns can bind the most avid 
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ligands so tightly as to be very difficult to elute. 
However the method of the invention can be successfully 
practiced to yield avid ligands, even covalent binding 
ligands. Ribosome complexes can be disrupted by 
denaturing agents such as urea or sodium dodecyl 
sulfate without affecting the integrity of the mRNA. 
Various mRNA •polypeptide copolymers may be separated 
into their component units based on the specific nature 
of linking between the RNA and the associated 
polypeptide. The mRNA 1 s of selected ligands are 
amplified, as described elsewhere herein, to yield a 
mixture of coding sequences enriched for those that 
encode polypeptide ligands of the desired target, 
including ligands that bind tightly, irreversibly or 
covalently . 

immunoreactivity of nascent polypeptides on 
ribosome complexes or mRNA- polypeptide copolymers can 
be used to purify the encoding mRNAs. In one 
embodiment, ribosome complexes are purified from cells 
in the presence of inhibitors such as chloramphenicol 
or cycloheximide which stall translational complexes on 
mRNA. Binding of antibodies which recognize the 
epitope of interest followed by binding antibodies 
which recognize those antibodies results in 
immunoprecipitation of the ribosome complexes 
containing the mRNAs which encode the epitope. The 
background of mRNAs which do not encode the epitope of 
interest but are trapped by the immunoprecipitated 
complex can be lowered by using purified IgGs against 
the epitope followed by purification of the 
immunoreactive ribosomes on a protein A column. (IgGs 
are one class of the soluble immunoglobulins which 
compose antisera. Protein A is derived from 
staph ylococcus aureus and has a high affinity for IgGs. 
Protein A binding does not interfere with epitope 
recognition. ) 

These procedures for immunoprecipitation to 



WO 93/03172 



PCI7US92/00801 



partition ribosoma complexes or mRNA. polypeptide 
copolymers can be used in a variety of modifications to 
partition the translational complexes in SPERT . One 
such modification is termed "panhandling" (See Figure 
4> a protein is composed of an immunoreactive domain 
for which known antibody exists, and a separate target 
domain for which one wishes to evolve protein ligands 
Kibosome complexes or mRNA • polypeptide copolymers which 
interact with the target domain (the "pan") via their 
nascent polypeptides will be immunoprecipitated upon 
binding antibodies which recognize the immunoreactive 
domain (the "handle"). This modification is especially 
useful for developing polypeptide ligands against a 
segment of a fusion protein in which the amino terminus 
is the fragment of a common protein (beta- 
galactosidase, for example) and the carboxyl-terminal 
portion is the protein of interest. It will also be 
useful for the development of polypeptide ligands which 
recognize immunoresistant domains of a protein which 
has an immunodominant domain for which polyclonal sera 
is available. Where immunoprecipitation is employed, 
it will be advantageous to discard any ribosome 
complexes or mRNA- polypeptide copolymers that react 
directly with the antibodies, prior to selection. 

Alternative partitioning protocols for 
separating polypeptides bound to targets, particularly 
proteins, are available to the art. For example, 
binding and partitioning can be achieved by 
immunoprecipitation of the test ribosome complex 
mixture or test mRNA- polypeptide copolymers mixture and 
passing the immune complexes through a protein A 
affinity column which retains the immune reactive 
polypeptide-containing complexes as the column. Those 
mRNA 1 s that encode a polypeptide that binds to the 
target antibody will be retained on the column as part 
of the ribosome complex or mRNA- polypeptide copolymer 
and unbound coding mRNA's can be washed from the 
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column. 

Interestingly, protein loops may be a powerful 
location for randomization and SPERT-based isolation of 
novel ligands. When inspecting protein structures in 
5 detail, only secondary structures are predictable; 

those structures include alpha helices and beta sheets 
or multiple strands, and either structure can be formed 
with parallel or anti-parallel peptides. The 
connectors between such secondary structures, called 

10 loops or hairpins, are related to RNA hairpin loops and 

RNA pseudoknots in that the locations of the ends of 
the loops are set by the secondary structures but the 
exact loop structures are idiosyncratic and dependent 
on the loop primary sequences and contacts with other 

15 elements of the protein. Loop sequences, when 

randomized and put through SPERT should provide vast 
structural libraries. Disulfide bonds between 
cysteines represent another means by which to construct 
loops; similarly, zinc fingers and copper or other 

20 metal "fists" also provide other kinds of loops. 

Effective partitioning can be carried out with 
pure or impure target preparations. In cases where 
target preparations are impure, selectivity can be 
enhanced by strategies that enhance the binding of 

25 ligands to the desired target, or which specifically 

elute desired ligands or prevent their binding. The 
latter approach is subtractive. A known ligand can 
block binding of any polypeptide that can bind the 
target so that the desired polypeptide is partitioned 

3 0 by elution and unwanted polypeptides are retained on 
the column. 

Optionally, chemical or enzymic modifications 
of the polypeptide can be introduced post- 
translationally . The process for making such 
35 modifications should not disrupt the ribosome complexes 

or mRNA • polypeptide copolymers . An important type of 
post-translational modification is oxidation to form 
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ZZZZ'JZZ especially -antageous to locK in 

rrr^rrr^. i.. .. 

„ »1 (1990) Science 249:257-263. 

' ^ o Jr fois of post-translational ~ 

n ™otide folding configuration by forming 
polypeptide toiQi y chains, 
coordination complexes with amino add 5 

potential binding activities or y 

, t „ 44- affords a means for 
of polypeptides. Also, revers ible 

:::;: a : ;-occu«r ng - the 

„ifiers during partitioning is only limited by the 
modifiers during P ribo so,»e ccplexes. 

need to maintain stability „w 4j ,w 

L mediun during partitioning, alternatively, SPERT 
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itself can be used to pre-select polypeptides which 
bind the modifier as a target after which the candidate 
mixture of selected modifier-binding polypeptides can 
be further selected, via SPERT , for binding the 
ultimate target. 

Sequence variation in the test coding mRNA 
mixture can be achieved or increased by mutation. For 
example, a procedure has been described for efficiently 
mutagenizing nucleic acid sequences during PCR 
amplification (Leung et al. 1989). This method or 
functionally equivalent methods can optionally be 
combined with amplification procedures in the present 
invention. 

Alternatively, conventional methods of DNA 
mutagenesis can be incorporated into the nucleic acid 
amplification procedure. Applicable mutagenesis 
procedures include, among others, chemically induced 
mutagenesis and oligonucleotide site-directed 
mutagenesis. 

The starting mRNA mixture is not limited to 
sequences synthesized de novo. In particular, SPERT 
can be used to modify the function of existing 
proteins. A segment of the natural sequence is 
replaced by a corresponding segment of randomized 
sequence in the mRNA that encodes the protein. Since 
many known proteins belong to families having some 
sequences conserved and others varied, the logical 
approach is to replace the variable (or hypervariable) 
regions with randomized sequence, to maximize the 
chance of altering function. The proper choice of 
partitioning conditions, as will be apparent to those 
skilled in the art, results in selection for the 
desired functional variant. In this way, 
modifications, alterations and improvements on known 
proteins can be achieved. 

To proceed to the amplification step when 
utilizing ribosome complexes, coding nucleic acids must 
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be released Iron, the target-bound rtto-e ^ ^ 
after partitioning. This process must be done wrthout 
chemical degradation of the coding mBNA-s and must 
result in amplifiable nucieic acids. In a specific 
embodiment, selected coding BNA molecules are eluted 
fro. a column using a high ionic strength buffer or 
ether eluant capable of disrupting the l«and-target 
hond. Alternatively, the ribosome can be °^ tured 
such that the mRNA is eluted. The coding mSKA can be 
removed from ribosome complexes or from ribosome 
eomplex-target pairs by pheno! extraction o«=l 
combined with a protein denaturing agent such as 7M 
urea. Although rihosomal RHA is also extracted 
subsequent amplification is selective 
because the primers used for cD»A synthesis and PCR 
amplification are complementary only to a °™"™« 
sequence in the mENA's and not to ribosomal RNA. 

As the translation of randomised mKHAs proceeds 
during the SPERT protocol, the growing P olypep " d ' 
maxes its way from the peptidyl transferase arte wrthxn 
the large ribosome subunit toward the oytoplasmrc 
^vent The peptidyl transferase site is an intrrnsrc 
activity of the large ribosome subunit from all 
organisms; that site has been defined funotro nally but 
its precise location within the ribosome is unxhown. 
However, the distance between that site and the 
cytoplasmic solvent also is Known to be about 30 to 40 
amino acids in length. 

F or optimal effectiveness in SPERT , the random 
portion of the nascent polypeptide (whose V™^** 
are selected during the procedure) should he "outside 
th e ribosome in order for partitioning of the ribosome 
complex to fully utilize the properties of the 
randomized polypeptide. A C-terminal tracer sequence 
is preferably incorporated into the translated 
polypeptide to insure that the randomized sequence is 
fully exposed after translation. From the work of 
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Smith et al, (PNAS, 75:5922, 1978) and Malkin and Rich 
(J. Mol. Biol., 26:329, 1967) for both prokaryotes and 
eukaryotes: about 30 to 40 amino acid residues remain 
within the ribosome during translation. Furthermore, 
if the amino-terminus of a growing polypeptide contains 
a hydrophobic domain of about 20 amino acid residues, a 
nascent polypeptide of about 50 residues has been shown 
to be enough to allow the translation complex to 
interact with a membrane by hydrophobic interactions, 
see Kurzchalia et al, Nature 320:634, 1986). Thus, in 
those preferred embodiments of SPERT utilizing ribosome 
complexes, the randomized polypeptide will be encoded 
by randomized mRNA that is about 30-40 codons (that is, 
about 90-120 nucleotides) upstream from the codons at 
which the translation complex stalls. It will be 
understood that both longer and shorter C-terminal 
trailer sequences can be used effectively, and that 
SPERT, itself, can be used to determine optimum trailer 
length for a given partitioning system. The sequence 
of mRNA and encoded polypeptide in the C-terminal 
trailer can be designed to have any other desired 
function, such as more stability in the translation 
complex, ease of in vitro manipulation, subsequent 
polypeptide purification, as a reporter activity for 
diagnostics, cell entry, etc. 

Polypeptides selected by SPERT can be produced 
by any peptide synthetic method desired. Chemical 
synthesis can be accomplished since the amino acid 
sequence of the polypeptide is readily obtainable from 
the nucleotide sequence of the coding mRNA. Since cDNA 
from the coding mRNA is available, the polypeptide can 
also be made by expressing the cDNA in a suitable host 
cell. 

SPERT offers, as noted above, an opportunity to 
sample as many as 10 18 peptide sequences during a 
rigorous experiment with a particular target. As such 
SPERT may be compared with in vivo technologies aimed 
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p"c. Hatl. a=ad. sci. SZ^VS, 1990). Because 

the intrinsic depth of those system xs 
SPERT Phage display systems allow 10 different 
SPERT. ^ y _ .,„ r, ft rhaDS 10 11 or so 



SPJSKX. nicry^- — ■ — c - 

peptides to be searched easiXy, and perhaps 
wich higger volumes and more diff acuities . SPERT 
lafa value for looking rigorously through large 

librari "o'th SPERT, as defined thus far, and the phage 
display systems have a disadvantage in common, * 1«* 
formally. In SPERT the peptide of interest as held by 
STrihosome, a machine that contains its own protons 
Trl Which is extremely large relative to the peptide of 
Merest. Similarly, in the phage display systems the 
Peptide of interest protrudes from a phage particle 
which is also relatively extremely large and „h ch 
contains its own proteins, although each of these 
Sterns will yield a peptide of interest wi th careful 
partitioning of the bound peptide from 
peptides bound to ribosomes or phage capsrds, an 
improved system would provide the peptide of interest 
^und to an encoding nucleic acid (to achieve reverse 
translation, free of any other large, proternaceous 
components. as described above, the large phage 
particle and the ribosome add limitations to these 
systems other than in the partitioning step of the 
process. The large entities also severely limit the 
number of random peptides that may be practically 
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generated and tested in the screening process. 

SPERT lends itself to such an improvement. In 
an alternate embodiment, this invention contemplates a 
simple and general mechanism by which a non-random 
portion of each peptide within the collection of 
peptides becomes covalently or very tightly attached to 
one end or the other of the mRNA that encodes it to 
form mRNA polypeptide copolymers. 

There are an almost unlimited number of 
specific systems that could be employed to generate 
mRNA* polypeptide copolymers. Any such system that 
allows the ribosomes in the translation mixture to have 
a high turnover can be useful. The in vitro reactions 
should be as free as possible from RNases. The RNAse 
problem may also be alleviated by using mutant strains 
to lower RNase levels. Alternately, various techniques 
familiar to those skilled in the art are available for 
making the mRNA nuclease resistant. Additional 
criteria for effective systems for forming 
mRNA» polypeptide copolymers include the following: 1) 
the interactions between the nascent polypeptide and 
the mRNA must either occur before the ribosome complex 
is disrupted, or at a rate that highly favors the 
interaction over dissociation of the proximal species; 

2) additional reagents should be relatively small; and 

3) the reaction between the nascent polypeptide and the 
mRNA should be relatively efficient (i.e., at least 
about 5% or greater) . 

A nonlimiting catalog of methods that can be 
employed to generate mRNA* polypeptide copolymers will 
generally fall into the following categories: 1) 
Adapted post-translational modification systems; 2) 
Activation of the 5' end of the mRNA species and the N- 
terminus of the peptides to promote relatively simple 
organic chemical type reactions between the species; 3) 
Attachment of the peptide to the mRNA prior to the 
onset of translation; and 4) tRNA crosslinking of the 
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„ aS cent -polypeptide and the mBHA. '^^f^ 
of each of these systems is described below. The 
lesign of additional embodiments of these general 
systl wo uld also he obvious to those skilled in the 
5 art. 

Tl mslatiglia] ""d<«i™t<nn Systems 

ESSt ^ n ^ n b^nTthIl=llection of ***** used 
in SPEKT is synthesized using T7 RKA polymerase and 
10 ^aTsine Phosphono monothioate for initiation (see 

lurgin et al., EMBO J. 1=4111, 1990. for example) , the 
monothioate is inoorporated only at the ^ 
nucleoside triphosphates are the souroe 

15 ZZZ Z -other funotions . • » - 

the collection could have, for example, biotin or any 
To of a number of -11 reagents affixed to the 5 ■ end 
of the EN A. Alternatively, mononucleotides labeled 
20 with biotin oould be used to initiate 

The 5. end of the FNA would oertainly not preolude 
translation by baoterial ribosomes, since those 
ri oosomes are indifferent to the chemical nature of 
5 . end as long as enough nuoleotides are 
25 upstream of the initiating AUG and as long as those 
nuoleotides oontain appropriate sequences to cause 
initiation to occur. 

According to this embodiment, the codons 
downstream from the AUG, also fixed, encode a peptide 
30 that has an extre-ely high affinity for or can be 

lovalently bound to the chemical adduct positioned at 
Z S of each mRNA- Known peptide seguences (such 
Tavidin, might be used if biotin were the cho s e n 9 
tag . in one example, a biotin ligase may be used to 
35 m.L covalent the interaction between the peptide and 

tL biotin at the end of the mB,A. See Cronan 
(Cell, 58=*27- 1989); Reed and Cronan (J. Bio. chem.. 
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266:11425, 1991) incorporated herein by reference. 
Many suitable pairs of chemical adducts and fixed 
peptide sequences have been identified, and are known 
to those skilled in the art. For example, certain 
polypeptides contain lipoylation sites, and the post- 
translation modification would utilize the lipoylation 
system. See, Rucker et_al. , (FASEBJ., 2:2252-61, 
1988); Ali et_al., (Mol . Microbiol. 4:943-50, 1990). 
For other post-translational modification systems, see, 
PCT Patent Application PCT/US90/02852 (published 
November 29, 1990, W0 90/14431). 

As the nascent peptide emerges from the 
ribosome, the most likely 5 • adduct to be bound by that 
peptide sequence will be the 5' adduct on the mRNA 
encoding that exact peptide (which will include, in 
this case, randomized peptide sequences downstream of 
the fixed peptide adjacent to the initiating 
methionine) . Again, with respect to biotin and biotin 
ligase, the first collisions will be irreversibly 
fixed. The length of the 5' end of each mRNA (that is, 
how many nucleotides upstream of the ribosome binding 
site are needed to enhance the binding reaction in cis) 
and the concentration of ribosomes that allow 
collisions between the nascent peptide of one ribosome 
and the 5* end of the mRNA of another can be determined 
easily without undue experimentation. This last point 
is clear from a simple calculation. Ribosomes are 
about 200 angstroms in diameter, so it may be assumed 
that the distance between the nascent, emergent peptide 
(from the large ribosome subunit) and the emergent 5« 
adduct of the mRNA (from the small ribosome subunit) 
will never be more than 500 angstroms apart and could 
be much less. The calculated concentration of the 
nascent peptide with respect to its own 5' adduct in 
cis is higher than 3 micromolar for a worst case 
scenario, and could be more than 100 times higher. 
Since the ribosome concentration in many cell-free 



WO 93/03172 



PCT/US92/00801 



translation experiments is sub-micromolar, xt xs not 
difficult to preclude scrambled binding between nascent 
peptides and 5- mRNA adducts on other ribosomes. 

As translation ends, after mRNA polypeptide 
copolymer formation and prior to enrichment for 
peptides that partition with a target, the cell-free 
reaction may be treated with puromycin and EDTA to 
disassociate the ribosomal subunits. ATA, poly U, or 
other non-amplif iable RNAs may be added to prevent 
rebinding of mRNAs to the ribosomes. Sxze 
fractionation may then be used to enrich for small 
material, and/or high speed centrifugation would 
eliminate the ribosomes and many of the proteins from 
the cell-free system from the mRNA. polypeptide 
copolymer (such copolymers may be truly covalent or 
merely effective copolymers when very hxgh af f xnxtxes 
are used for the linkage) . More complete purif icatxons 
of the copolymer prior to partitioning with target are 
obvious. For example, hybridization to column-bound 
complementary DNA (to one end of the mRNA) and 
subsequent elution would give full purification. 
Similarly, the fixed peptide could include an 
additional sequence for this purification; a small 
epitope would do, thus allowing purification of the 
mRNA-polypeptide copolymer with antibodies against that 

epitope. . . 

The mRNA-polypeptide copolymer is partitioned 
as in the ribosome complex examples, and the bound mRNA 
amplified via cDNA synthesis and PCR, as always 
extending the cDNA to create again the T7 promoter 
sequence for the next round of SPERT. The peptide 
attached to the 5' end of the mRNA may cause the 3' end 
of the cDNA to be a bit shorter than in the absence of 
peptide, but PCR easily accomplishes the full 
restructuring of the DNA for subsequent transcription, 
in this case initiated once again by phosphono 
monothioate nucleotide for adding the small organic 
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molecule needed for linkage. 

in this alternate embodiment of SPERT, the 
peptide is directly linked to the encoding ™cl.i*«ia 
Z is partitioned to target <or reacted m any other 
s Tay described for SPERT) with only the ^"J""^ 

a cid available (along with the peptide =°"~«°"> ^ 
that target. The very large ribosome or phage capsrd 
„o longer obscures the partitioning reacts m any 
way. 

~ e post-translational modification systems 

described above generally require an enzyme to 
"tate the reaction between the nascent pept.de and 
15 :i -cording to this embodiment, - mod, y^ 

enzyme is eliminated, and relatively sample chemxcal 
reactions are relied on to form the copolymers 

in one embodiment of this system, sulf ur-halxde 
chemistry is employed, sulfur may be incorporated on 
20 the 5- end of the mRNA using the T7 RNA polymerase and 

Lothiate for initiation as described above. , halxde 
can be incorporated on the N-terminus of the peptide by 
US e of N-haloacetyl-met-tRNA f - (Pellegrinx fiLfll., 
"roc. Natl. Acad. Sci. USA, 69:83741, 1972); Soparx, 
etal., (Biochemistry, 13:5432-39, 1976)). Thxs 
cognation would result in spontaneous nucleophxlxc 
substitution to form a thioether linKage between the 
nascent polypeptide and the mRNA. In order to avoxd 
reaction of the halo-acetyl group with DTT xn the 
30 translation mixture, or with cysteine residues xn 

ribosomal proteins, it is preferred that the chloro 
acetyl functionality be utilized. 

in a further embodiment of this process, xt may 
be desirable to accelerate the reaction between the 
35 nascent polypeptide and the mRNA by introducing a 

"chaperone" RNA sequence. The chaperone acts as a 
catalyst to facilitate the nucleophilic substxtutxon 
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easily TLeSl ciarone «V be selected 

SELEX technology, a userui v cent 
b y placing a stretch of random noncodmg «K accent 
tie 5. GMPS torn A, and ccllecting these seguenoes 
^aL or reacting with the halo-acetyl H-terrornal 
peptide, This r ^ ~ _ „_ 
ST-tC P^ent a probable nucleic acid 

=rjr . : rrprtirr^ . « 

catalyst to facilitate the reaction. 

BBsaaaHTi q of m*NA to Pep tide^ 

I„ one embodiment of the formation of 
^•polypeptide copolymer,, the mHNA £ 
the nascent polypeptide before trans t, n ^ 
ir,ii-iated In one embodiment, tnis pre 
^uUn would occur by attaching the 
mSKA te W-amine group o £ methxonrne en 
vie e cevalent linger. As trenelatien proceeds the 
Initiating methionine is already attached to the mRHA 
at the initial amino acid sequence. 

crasaUaapg - f Mssaaas and peptjte- 

^cording to this embodiment, a talent 
linkage is created between peptidyl-tRNA end mRBA. A 

embodiment of this system is hesed on studres 
:Hhe photoreecticn between the base or yeast 
,W» and mRHA. see, Matzke «t_al. . Nat1 -. 
^a. sci. USA. 77 = 5110-14, 1980). See also, sterner 
etal (Nucl. Acids. Res. 12:8181-91, 1984, 
titration that tRNA can undergo peptidyl tracer 
Ld translocate normally -m A-site ~ -r e a f te r 



25 



and translocate normi,, — 

being crosslink to mRNA) ; Paszyc et^I- (Nucl. Acxd 
Z 6:385-97, 1979). A nonsense suppressor containing 



Re g. 6:385-97, x.,^. crosslinlc to the 

the Y base may be used tnar wij-j- 
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message at the end of peptide synthesis, resulting n 
peptide-tRNA-mRNA covalent complex. The peptide-tRNA 
linkage could be made into a stable amide linkage by 
making the 3- terminus of the tRNA 2 • -deoxy-3 • -amino- 
adenosine. See, Fraser etal- (Meth. Enzymol. 49.135 

45 ' ^^continuous irradiation of this system during 
translation would yield photocrossl inked 
^•polypeptide copolymers. An advantage of this 
embodiment is that there would not be any constraints 
on the peptide or message. 

It is an important and unexpected aspect of the 
present invention that the methods described hereincan 
be employed to identify, isolate or produce polypeptide 
molecules which will bind specifically to any desired 
target molecule. Thus, the present methods can be 
employed to produce polypeptides specific for binding 
to a particular target. 

Proteins contain within their primary sequence 
the information required to form an extraordinary 
variety of three dimensional shapes as is well known m 
the art. From this variety of potential shapes, along 
with the charge and/or hydrophobic qualities of amino 
acids, comes the potential for protein functions that 
are used in the biosphere. Proteins provide catalysis 
when embodied as enzymes; proteins can provide stable 
biological structures, for example, when used to 
construct spores, membranes, or viruses; and proteins 
can provide binding to a variety of targets, with 
appropriate affinities and kinetic parameters to allow 

Nevertheless, this vast potential in chemical 
activities, including the extreme potential inherent in 
the mammalian immune system, has actually been explored 
rather narrowly by organisms. This fact can be noted 
with a simple calculation. If the average length of a 
protein is 300 amino acids, and if there are twenty 
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natural amino acids used to construct modern proteins, 
^number of possible sequences of proteins of average 
size is 2 0 300 or -10* 00 . Estimates of the number of 
particles in the universe are in the range 10 , whxle 
estimates for the number of proteins ever explored xn 
the entire history of the earth are in the range 10 
The tiny fraction of so-called sequence space that has 
Teen explored by biology is a result of -lutxonary 
history and the relatively short age of the earth, 
present invention provides the means to explore protexn 
sequence space without historical and evolutionary 
limitations, while continuing to respect limxtatxons 
established by the number of particles in the unxverse. 
The invention provides the means to identify and 
isolate polypeptide ligands with any desxred quality 
from vast mixtures of protein sequences comprised 
largely of individual entities that have never before 
existed. The amino acid sequence of the selected 
ligand can be learned from the nucleotide sequence of 
its encoding mRNA, making tedious amino acid sequence 
determination unnecessary. 

Even where the binding functions selected by 
SPERT have known naturally occurring counterparts, 
there is no reason to expect that the polypeptides 
25 selected by SPERT will resemble natural ly-occurrxng 

proteins or peptides having similar functxon^ Inmost 
instances, SPERT-selected polypeptides wxll be smaller 
than naturally-occurring proteins typically havxng . 
size of from 4-100 amino acids, preferably from 4-50 
amino acids selected from randomized sequence of the 
same length, and also having a C-terminal traxler of 
about 30-40 amino acids and, optionally a N-termxnal 
leader of about 10 amino acids, for a total length of 
about 100 amino acids, corresponding to a molecular 
35 weight of about llkd. This is smaller than most 

enzymes and all antibodies, for comparxson, igG has a 
molecular weight of about 150kd. Furthermore, many 
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polypeptide ligands of the invention will function when 
freed by N- and C- terminal trailers. Therefore, the 
final product can be as small as 4-50 amino acids. The 
polypeptides of the invention are non-naturally- 
occurring, and typically differ in amino acid sequence 
and molecular size from naturally-occurring proteins. 
That portion of the amino acid sequence arising from 
randomized coding is designated the "binding segment" 
herein. The binding segment can be of any length, 
conveniently ranging from about 4-100 amino acids in 
length, preferably from about 15-50 amino acids in 
length. Additionally, given the vastness of sequence 
space, it is expected that most polypeptide ligands of 
the invention will have less than 50% homology with 
natural proteins, and preferably less than 30% amino 
acid homology with natural proteins. 

A polypeptide ligand of the invention in a 
number of ways functionally resembles an antibody. 
Polypeptide ligands which have binding functions 
similar to those of antibodies can be isolated by the 
methods of the present invention. Such polypeptides 
are generally useful in applications in which 
polyclonal or monoclonal antibodies have found 
application. However, the polypeptide ligands of the 
invention have significant advantages over antibodies: 
they can be selected for any desired affinity, 
including higher affinities than are obtainable with 
antibodies, they can be selected to bind at any desired 
epitope or combination of epitomes, including binding 
sites not recognized by antibodies, they can be larger 
or smaller and have different solubility properties 
than antibodies and they can be generated by techniques 
that operate entirely in vitro , without the need for 
live animals or cell culture techniques. Applications 
of polypeptide ligands include the specific, 
qualitative or quantitative detection of target 
molecules from any source; purification of target 
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molecule's based on their specific binding to the 
polypeptide; and various therapeutic methods which rely 
on the specific direction of a toxin or other 
therapeutic agent to a specific target site. Target 
molecules are preferably proteins, but can also include 
among others carbohydrates, nucleic acids, 
peptidoglycans and a variety of small molecules. As 
with conventional antibodies, polypeptide ligands can 
be employed to target biological structures, such as 
cell surfaces or viruses, through specific interaction 
with a molecule that is an integral part of that 
biological structure. Polypeptide ligands are 
advantageous in that they are not limited by self 
tolerance, as are conventional antibodies. Also, as 
noted, polypeptide ligands of the invention do not 
require animals or cell cultures for synthesis or 
production, since SPERT is a wholly in vitro process. 
The methods of the present invention related to the use 
of polypeptide ligands can generate novel polypeptides 
that bind targets for which other proteinaceous ligands 
are known. For example, a number of proteins are known 
to function via binding to nucleic acid sequences, such 
as regulatory proteins which bind to nucleic acid 
operator sequences. The known ability of certain 
nucleic acid binding proteins to bind to their natural 
sites, for example, has been employed in the detection, 
quantitation, isolation and purification of such 
proteins. The methods of the present invention related 
to the use of polypeptide ligands can be used to make 
novel nucleic acid binding ligands having affinity for 
nucleic acid sequences which are known to bind proteins 
and to nucleic acid sequences not known to bind 
proteins. Novel, non-naturally-occurring polypeptides 
which bind to the same binding sites of nucleic acids 
can be developed using SPERT. As will be discussed 
below, certain polypeptides isolatable by SPERT can 
also be employed to affect the function, (for example 
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inhibit, enhance or activate) specific target molecules 
or structures. Specifically, polypeptide ligands can 
be employed to inhibit, enhance or activate the 
function of proteins and of nucleic acids. 

It is a second important aspect of the present 
invention that the methods described herein can be 
employed to identify, isolate or produce polypeptide 
molecules which will bind specifically to a particular 
target molecule and affect the function of that 
mo lecule. in this aspect, the target molecules are 
again preferably proteins or nucleic acids, but can 
also include, among others, carbohydrates and various 
small molecules to which specific polypeptide binding 
can be achieved. Polypeptide ligands that bind to 
small molecules can affect their function by 
sequestering them or by preventing them from 
interacting with their natural ligands. For example, 
the activity of an enzyme can be affected by a 
polypeptide ligand that binds the enzyme's substrate 
Polypeptide ligands of small molecules are particularly 
useful as reagents for diagnostic tests, or other 
quantitative assays. For example, the presence of 
controlled substances, bound metabolites or abnormal 
quantities of normal metabolites can be detected and 
m easured using polypeptide ligands of the invention. 
Antibodies to polypeptide ligands can be used to 
precipitate or bind ligand-target pairs to a solid 
phase matrix in a diagnostic assay. A polypeptide 
ligand having catalytic activity can affect the 
function of a small molecule. by catalyzing a chemical 
change in the target. The range of possible catalytic 
activities is at least as broad as that displayed by 
natural proteins. 

The strategy of selecting a ligand for a 
transition state analog of a desired reaction is one 
method by which catalytic polypeptide ligands can be 
selected. Polypeptide ligands with high affinity for 
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transition-state analogues are liKely to have enzymatic 
activity, as has been demonstrated for monoclonal 
antibodies directed against transition-state analogues. 
These antibodies have exhibited a wide range of 
catalytic activities, including acyl-transfer reactxons 
[Pollack et al. , Science 234:1570 (1986); Tramantano et 
al.. Science 234:1570 (1986); Jacobs et al., J. Am. 
Chem. soc. 109:2174 (1987); Napper et al. , Science 
^1041 (1987); Janda et al. , Science (1988); 
^chultz, F.G., Science 240:426 (1988); Benkovxc et al., 
proc. Natl. Acad. Sci. 85:5355 (1988)], carbon-carbon 
bond formation [Jackson et al. , J. Am. Chem. Soc. 
110:4841 (1988); Hilvert and Nared, J. Am. Chem. Soc. 
Ho~-5593 (1988)], carbon-carbon bond cleaving reactxons 
^Cochran et al., J- Am. Chem. Soc. 110:7888 
peptide cleavage [Iverson and Lerner, Science 212:1184 
(1989)], and ester bond hydrolysis [Janda et al.. 
Science 244:437 (1989)]. The number of polypeptide 
sequences and structures that can be explored by SPERT 
far exceed those available in the immune system. 

Enzymes are evolved using SPERT and starting 
randomized sequences corresponding to about 50 amxno 
acids, as in Example 3. Enzymatic polypeptide Ixgands 
of small size are entirely unanticipated by the present 
understanding of enzymology; enzymes are always much 
larger in nature than the scientist expects. The 
specific transition state analogues used are drawn from 
the literature cited above. Among the reactions probed 
by the monoclonal antibody-enzymes are some which lead 
to the breakdown of toxic waste products, including 
chemicals with chlorine-carbon bonds and carbon-carbon 
bonds in ring structures like those found in benzene 
and polychlorinated phenols. 

The binding selection methods of the present 
invention can be combined with secondary selectxon or 
screening to identify ligands capable of modifyxng 
target molecule function upon binding. The large 
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population of variant amino acid sequences that can be 
tested by SPERT enhances the probability that 
polypeptide sequences can be found that have a desired 
binding; capability and that function to modify target 
molecule activity. The methods of the present 
invention are useful for selecting polypeptide ligands 
which can selectively affect function of any target 
protein. The methods described herein can be employed 
to isolate or produce polypeptide ligands which bind to 
and modify the function of any protein or nucleic acid. 
It is contemplated that the method of the present 
invention can be employed to identify, isolate or 
produce polypeptide molecules which will affect 
catalytic activity of target enzymes, i.e., inhibit 
catalysis or modify substrate binding, affect the 
functionality of protein receptors, i.'e., inhibit 
binding to receptors or modify the specificity of 
binding to receptors; affect the formation of protein 
multimers, i.e., disrupt quaternary structure of 
protein subunits; and modify transport properties of 
protein, i.e., disrupt transport of small molecules or 
ions by proteins. 

Secondary selection methods that can be 
combined with SPERT include among others selections or 
screens for enzyme inhibition, alteration of substrate 
binding, loss of functionality, disruption of 
structure, etc. Those of ordinary skill in the art are 
able to select among various alternatives those 
selection or screening methods that are compatible with 
the methods described herein. 

An embodiment of the present invention, which 
is particularly useful for identifying or isolating 
polypeptides which bind to a particular functional or 
active site in a protein, or other target molecule, 
employs a molecule known, or selected, for binding to a 
desired site within the target protein to direct the 
selection/amplification process to a subset of 
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polypeptide ligands that bind at or near the desired 
sitfwithin the target molecule. In a simple example, 
a polypeptide sequence known to bind to a desired sxte 
in a target molecule is incorporated near the 
randomized region of all polypeptides being tested for 
binding. SPERT is then used to select those variants, 
all of which will contain the known binding sequence, 
which bind most strongly to the target molecule. A 
longer binding sequence, which is anticxpated to either 
bind more strongly to the target molecule or more 
specifically to the target can thus be selected. The 
longer binding sequence can then be introduced near the 
randomized region of the polypeptide test mixture and 
the selection/amplification steps repeated to select an 
even longer binding sequence. Iteration of these steps 
(i e., incorporation of selected sequence into test 
mixtures followed by selection/amplification for 
improved or more specific binding) can be repeated 
until a desired level of binding strength or 
specificity is achieved. This iterative "walking- 
procedure allows the selection of polypeptides hxghly 
specific for a particular target molecule or sxte 
within a target molecule. Another embodiment of such 
an iterative "walking" procedure, employs an "anchor" 
molecule which is not necessarily a polypeptide or 
amino acid. In this embodiment a molecule which bxnds 
to a desired target, for example a substrate or 
inhibitor of a target enzyme, is chemically modifxed 
such that it can be covalently linked to a bridge 
molecule which in turn is known to be bound to an 
oligopeptide of known sequence. The bridge molecule 
covalently linked to the "anchor" molecule that bxnds 
to the target also binds to the target molecule. The 
sequence encoding the known bridge-binding oligopeptide 
is incorporated near the randomized region of the test 
nucleic acid mixture. SPERT is then performed to 
select for those polypeptide sequences that bxnd most 
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strongly to the target molecule/bridge/anchor complex. 
The iterative walking procedure can then be employed to 
select or produce longer and longer polypeptide 
molecules with enhanced strength of binding or 
specificity of binding to the target. The use of the 
"anchor" procedure is expected to allow more rapid 
isolation of polypeptide ligands that bind at or near a 
desired site within a target molecule. In particular, 
it is expected that the "anchor" method in combination 
with iterative "walking" procedures will result in 
polypeptides which are highly specific inhibitors of 
protein function. 

In accordance with the teachings of copending 
applications Serial No. 07/536,428 and Serial No. 
07/714,131, the translated mRNA of a ribosome complex 
or mRNA-polypeptide copolymer is, in principle, capable 
of binding to target molecules and of being partitioned 
concurrently with nascent polypeptides. In particular, 
where partitioning is accomplished by affinity 
chromatography, the selected ligand can be an RNA, 
rather than a polypeptide. Binding of mRNA can be 
differentiated from polypeptide binding once the ligand 
has been selected and both the selected polypeptide and 
its coding mRNA are available for independent direct 
binding studies where the two are not part of the same 
ribosome complex. Comparative studies of the relative 
frequency of RNA ligands and polypeptide ligands 
selected by SPERT are of fundamental biological 
importance to understanding the specialization of 
function that currently exists in living cells. This 
direct comparison between RNA and peptide during the 
SPERT cycles may prove to be surprisingly robust. As 
described in the SELEX applications, large numbers of 
protein targets will yield a tight-binding RNA ligand. 
For a given target it can not be predicted whether RNA 
or peptide will give more useful ligand solutions, and 
thus SPERT can be seen as an improvement to the SELEX 
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application because when MA Yields the best ligand 
solutions the data will lee* to that conclusion 
immediately. For example, the BHA Ugand solutions 
will he indifferent to the reading frame in which the 
conserved » sequence or structure is found while ^e 
peptide solutions will force the HNA solutions to have 
I common sequence in the same reading . frame. 

The polypeptides of the invention can be 
selected for P o£er P Properties in addition to bind^g 
Tor example, during partitioning, ^^"Z 
conditions of the desired worKing environment of the 
end product can be included as a selection -iterion 
!f a polypeptide which is stable in the presence of a 
certain Protease is desired, that protease can b^part 
of the buffer medium used during partitioning. As will 
oe understood, when utilizing ribosome fae 
conditions which disrupt ribosome complexes should be 
a^ued. Other desired properties can be incorporated, 
directly into the polypeptide sequence as will be 
^derstood by those sKilled in the art. For example 
^e affinity can be included as a property either 
by employing a N- or c-terminal trailer having high 
"ydrophobicity. or by biasing the random is ed coding to 
favor the amino acids with lipophilic side ™" 

Th e coding nucleic acid concomitantly selected 
by partitioning nascent polypeptides as Ascribed is 
useful in its own right to transform host cells or 
danisms. The transformed organism is then useful 
for e.g.. fermentation production of the selected 
polypeptide. A transgenic organism can he rendered 
^sIstLt to a virus infection, for example, by causing 
^vivo synthesis o, a polypeptide ligand of the viral 
- I^c acid or a Key viral protein. * 
functionality contributed by a polypeptide ligand of 
the invention can be bestowed on a suitable host 
organism. Methods Known in the art can be used to 
combine the coding region with a promoter. 
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polyadenylation signal functional in the intended host, 
followed by incorporation into a suitable vector for 
transformation, all as known and understood in the art. 

F.XAMPLES 

The techniques and methods used in the ensuing 
examples are published and known in the art. Together 
with adaptations and modifications known to those of 
ordinary skill in the art, the procedures not 
specifically referenced herein are available from known 
reference works. In addition to Sambrook et al., 
(1989) supra, Genetic Pnrti neerinq. Plenum Press, New 
York (1979); Weir, (ed.) (1986) Handbook of 
Experimental Imjnunology. in Four Volumes, 4th Ed, 
Blackwell Scientific Publications, Oxford; and the 
mu ltivolume — unsvoloav published by Academxc 
Press, New York. Polymerase chain reaction techniques 
are described in £CR Protocols (Michael A. Innis, et 
al eds.) (1990) Academic Press, Inc. 

Throughout examples 1-9, reference is made to 
Tables 1 and 2. Table 1 lists oligonucleotide 
sequences used for preparing mRNA candidates. Table 2 
lists the same sequences together with explanatory 
notes showing functional domains. Sequences in 
25 capitals are chemically synthesized, sequences in lower 

case letters are complementary sequences made 
enzymatically by DNA polymerase. The Examples could be 
adapted by those of ordinary skill in the art to 
generate mRNA* polypeptide copolymers as taught herein 
30 without undue experimentation. 

Example 1. Direct Immunoprecipitation of Ribosome 

Complexes: Polypeptide Ligands Directed 
Toward Immunoglobulin Molecules. 

The method of the invention is used to select 
novel polypeptides that bind the antibody of an epitope 
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commonly recognized by the antisera fro. — 
mi ce which are the fl progeny of a cross of -d 
parents (Portanova et al., J- Immunol 144, 4633 
(199 0). The known epitope consists of about 10 
contiguous amino acids at the amino terminus of the 
nistone H2B protein. To make mRNA encodxng candidate 
polypeptides, a 5- fixed sequence composed of a T7 
propter sequence and a ribosome binding sxte whxch xs 
recognized by both prokaryotic and eukaryotxc 
r iboIomes, terminating in a restriction endonuclease 
site is synthesized and cloned using olxgonucleotxdes 
h aving the sequences shown as sequence 1 in 
and 2 and in Figure 8 . A3' fixed sequence xs placed 
into a restriction site to provide an mRNA encodxng the 
C-terminal trailer sequence of ca. 100 ^^f^ 
Xacking stop codons (for ca. 30-35 amxno acxds) shown 
as sequence 3 in Tables 1 and 2 and Fxgure 8. In 
addition, as shown in Figure 1, a 3- pri-r anneaUng 
site (sequence 3) is provided so that cDNA synthesxs 
can be accomplished on the mRNA recovered from 
partitioned ribosome complexes. 

The randomized polypeptide insertion sxte is 
bounded by restriction endonuclease recognition sites, 
in this example EcoRI and Pstl. A single-stranded 
oligonucleotide is synthesized with a randomized 
sequence of 45 nucleotides (corresponding to 15 codons) 
bounded by specific sequences that include those two 
restriction endonuclease sites (Sequence 4a) . 
Synthesis of randomized oligonucleotides is earned out 
using an Applied Biosystems DNA synthesizer provided 
with a reactant mixture for each nucleotide posxtxon. 
To partially compensate for the amino acid sequence 
bias inherent in the redundancy of the genetic code, 
the reaction mixtures contain, on a mole percent basxs, 
the following composition of bases for each codon: 
First position, C-20%, T, A, and G-30% each; Second 
position, C-15%, A-35%, T and G-25% each; Thxrd 
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position, T, C, A and G-25% each. Using a nucleic acid 
primer that is complementary to the fixed 3' end of the 
randomized oligonucleotide, randomized double-stranded 
DNA is created with the action of DMA polymerase. The 
5 products are digested with the two restriction 

endonucleases and ligated between the 5' fixed sequence 
and the 3« fixed sequence discussed above. In vitro 
transcription of these ligated templates using T7 RNA 
polymerase (Bethesda Research Laboratories, 
10 Gaithersburg, MD) provides mRNA templates for in vitro 

translation. A rabbit reticulocyte lysate system (BRL) 
is used to translate the mRNA templates in vitro, using 
standard reaction conditions. Such translation of 
these transcripts results in a variety of ribosomal 
15 complexes (mRNA-nascent polypeptide-tRNA-ribosomes) 

that are identical except for the randomized region of 
the nascent polypeptide. 

Antibodies (IgGs) , Portamova et al. , supra, 
which recognize the H2B histone epitope are added to 
20 the in vitro translation mixture. Immunoprecipitation 

of the immunoreactive ribosome complexes partitions the 
mRNAs species that encode the highest-affinity 
polypeptide ligands in the population (see Figures 3 
and 4). immunoprecipitated complexes are separated by 
25 low speed centrifugation. cDNA is synthesized from 

these mRNAs and is used via PCR to provide template for 
further cycles of transcription, translation, 
immunoselection and cDNA synthesis. 

Clones are isolated as described in Application 
30 07/536,428, June 11, 1990, incorporated herein by 

reference. The individual polypeptide products are 
over- produced and purified and tested, using standard 
techniques for reactivity to the anti-H2B histone 
antibodies. In addition, the polypeptide ligands are 
35 challenged competitively with authentic histone H2B- 

derived epitomes to discover which polypeptide ligands 
bind to the same portion of the antibodies as the true 
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the H2B epitope binding site. 

Example a. Agnostics using the 

o£ Example 1: An assay for anti-H2B 
antibodies in the progeny o£ HZB X NZW 
mice. 

auto-immune diseases result from the 
^oration o £ an inappropriate antibody -leoule wxtb 
„ reactivity toward a normal ^ ^ 

pro tein, but --"-—^ ^riUL 

reagent ligand for the diagnostic recognition of the 
^antibody that recognises the histone »■ .epitope 

As in Example 1. ribosome complexes ere treeted 
with the auto-antibody to partition reactive 
25 peptides frem non-reactive polypeptides resident 
Nascent polypeptides, in 

auto-antibodies are used to precipitate the « 

„ot resemble in detail the epitope identif as the 
nortion of th e target that reacts with the antibody, 
portion of th 9 trigge red by unknown antigens, 

Auto-immune diseases are ^igyei 
which are not necessarily the same as the 
^rgeVepitope identified as the interactive spec.es 
XL clinical stage of the auto-immune disease. 
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For example, a virus infection may trigger an immune 
reaction that yields a class of antibodies that cross- 
react with a normal cellular target. Such antibodies 
may bind more avidly to the original, stimulatory, 
viral antigen than to the epitope on the cellular 
target. As another example, the epitope on the 
cellular target may not take full advantage of the 
binding site on the antibody. 

The polypeptide ligand is used diagnostically 
to measure the quantity of circulating auto-antibody, 
using, e.g., an ELISA assay. The technology is 
available to one skilled in the art, without undue 
experimentation. As another example, the fixed portion 
of the polypeptide ligand is used as the reporter 
substance when the polypeptide ligand interacts with 
the circulating auto-antibody. With a fixed carboxy- 
terminus of beta-galactosidase or alkaline phosphatase, 
serum protein samples attached to plastic plates are 
assayed directly for the anti-H2B antibody by 
"staining" with the polypeptide ligand covalently fused 
(by recombinant DNA techniques) to either reporter 
enzyme. 

Example 3. Indirect Immunoprecipitation : Polypeptide 
ligands directed toward domains of any 
protein . 

immunization of animals with antigens, whether 
crudely prepared or purified, often results in immune 
responses directed at a subset of the available 
epitomes in that antigen. The polyclonal sera may 
react largely with a single protein domain in that 
antigen. Similarly, when researchers attempt to raise 
antibodies against fusion proteins, often the well- 
known fusion partner is immunodominant over the new 
protein portion of the fusion. 

Antibodies aimed at a protein target (but that 
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do not recognize the portion of the target that one 

wishes to use as the target in SPERT) allow INDIRECT 

ixnraunoprecipitation of ribosome complexes. That is 

. «- = ^„ T , ie: a useful partitioning step when 
immunoprecipxtation is a useruz y 

antibodies are aimed at domains in the target that are 
different from those domains pre-selected for SPERT 
based ligand evolution. This protocol is sometimes 
called "panhandling", and can yield high-af f mity 
polypeptide ligands for target domains that are weakly 
immunogenic. . 

SPERT is performed using variable material 
prepared as in Example 1 except that the randomized 
mRNA regions are now set to yield about 50 amino acids 
in the solvent-exposed nascent polypeptide. Biased 
randomization is done so that chain termination codons 
are not likely over the 150 randomized nucleotides; m 
addition, cell-free translation is performed in the 
presence of so-called suppressor tRNAs so that 
translation continues to the desired portion of the 
mRNAs. 

The population of ribosome complexes is pre- 
treated with the antisera aimed at the target protein, 
but in the absence of that target protein. The pre- 
treatment is designed to eliminate any nascent 
polypeptides that react directly with the antibodies, 
as in Example 1. The target protein is then added to 
the ribosome complexes, along with antibodies aimed at 
the target protein. Partitioning occurs as the 
ribosome complexes that interact with the target at the 
same time (see Figure 4) . 

The single-stranded DNA binding protein of 
bacteriophage T4 (g P 32) has an acidic carboxyterminal 
region which is immunodominant (K. Krassa, Ph.D., 
Thesis, 1987) . In one immunization experiment, 
polyclonal sera react exclusively with the 
carboxyterminal domain of the protein; 12 monoclonal 
cell lines derived from hybridoma fusions with spleen 
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cells from such immunized animals produced antibodies 
that react with the same target domain. Purified 
polyclonal sera which react with the carboxy-terminal 
domain of gp32 are used for indirect 
immunoprecipitation in this example. 

A population of ribosome complexes is produced 
(above) . These ribosome complexes are pre-treated with 
the polyclonal sera aimed at g P 32; this is readily 
accomplished by passing the ribosome complexes through 
Staph A columns pre-bound with the polyclonal sera 
against gp32. Subsequently, those ribosome complexes 
unable to react directly with antibodies raised against 
gP 32 are reacted with g P 32, followed by treatment with 
the sera aimed at the carboxy-terminus of gp3 2. Goat 
anti-mouse antibodies are used to immunoprecipitate 
g P 32 and whatever ribosomal complexes interact with the 
core domain of g P 32. Cycles of SPERT are continued 
until a desired level of binding is attained. 
Sequences are then cloned and individuals identified 
and tested for affinity to gp32. 

Example 4. Isolation of a polypeptide ligand for a 
serine protease. 

Serine proteases are protein enzymes that 
catalyze hydrolysis of peptide bonds within proteins, 
often with high selectivity for specific protein 
targets (and, of course, for specific peptide bonds 
within the target protein) . The serine proteases are 
members of a gene family in mammals. Examples of 
serine proteases are tissue plasminogen activator, 
trypsin, elastase, chymotrypsin, thrombin, and plasmin. 
Many disease states can be treated with polypeptide 
ligands that bind to serine proteases, for example, 
disorders of blood clotting. Elastase inhibitors are 
likely to be useful in minimizing the clinical 
progression of emphysema. Proteases other than serine 
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proteases are also important in mammalian bxology , and 
Lese too are targets for polypeptide Ixgands wxth 
appropriate affinities obtained according to the 
invention herein taught. 

A ligand that binds to porcine elastase is 
identified and purified using the starting randomized 
material of Example 3. Serine proteases are easxly 
attached by standard methods to column support 
xnaterials with retention of enzymatic actlV1 ^; 
Porcine elastase attached to agarose is available from 
commercial sources. Thus, in this example affinity 
chromatography is the partitioning method. Natural 
elastase inhibitors are available, and are used to 
checK that the active site of the bound elastase xs 
available for the binding of an inhxbxtory Ixgand. The 
buffer used for binding during the SPERT cycles must 
not denature or otherwise inactivate elastase; 
dithiothreitol, which can reduce protein dxsulfxde 
bonds, is left out of the binding buffer. 

After several rounds of SPERT, as the affxnxty 
of the mixture of nascent polypeptides becomes high, a 
reversal of the elution parameters is used. Early 
rounds of SPERT are aimed at obtainxng any polypeptxde 
ligand that binds to any domain of elastase ; after 
virtually all the nascent polypeptides are able to bxnd 
the column, the ribosome complexes are poured through a 
column that has been pre-saturated with a natural 
inhibitory ligand for the elastase active sxte. In 
addition, the elution buffer for this procedure 
includes high concentrations of that same natural 
inhibitory ligand. The ribosome complexes that are not 
bound in this reversed elution procedure are used to 
prepare mRNAs for further SPERT cycles, once agaxn 
depending on high affinity for the bound elastase. 
^procedure focuses the evolving polypeptide Ixgands 
toward the elastase active site. _ 

When the mixture of polypeptide Ixgands has a 



WO 93/03172 



PCT/US92/00801 



65 

high affinity for the bound elastase, and is aimed 
primarily toward the active site, further enrichment 
for high affinity inhibitors of elastase activity is 
accomplished by including low concentrations of the 
5 natural inhibitors in the partitioning steps, thus 

demanding that the evolving polypeptide ligands have 
higher affinity than the effective affinity of the 
natural inhibitor at the concentration used. 

Nucleic acids encoding polypeptide ligands are 

10 cloned and sequenced, and binding affinities and 

inhibitory binding affinities for elastase are 
measured. Binding affinities and inhibitory 
efficiencies are measured with the same polypeptide 
ligands for other members of the serine protease family 

15 in order to ascertain specificity within the family. 

Example 5. Polypeptide ligands that antagonize a 

receptor: A synthetic inhibitor of the 
interleukin-1 receptor. 

20 

Receptors are a class of proteins that are 
partially integrated into the cell's cytoplasmic 
membrane such that a domain resides outside the cell. 
That domain serves as a binding site for cell extrinsic 

25 molecules, including growth factors, peptide hormones, 

non-peptide organic molecules (which may include 
hormones), or even ions. Receptors handle the bound 
ligand in several different ways, including signal 
transduction through the membrane or internalization of 

30 the bound ligand for its subsequent function. In 

either case polypeptide ligands of the invention may be 
used to affect function of the receptor, that is to 
cause the normal activity of the natural ligand or to 
block that activity. 

35 Receptor antagonism for a useful therapeutic 

purpose is accomplished by generating a polypeptide 
ligand through SPERT that is aimed at the interleukin-1 
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receptor. A natural antagonist of the receptor 
has been found (Hannum et al.. Nature, 343:336-340 
(199 0), Eisenberg et a!., Nature, 343:341-346 (1990), 
and that antagonist has the presumptive utility of 
preventing or easing inflammatory problems such as 
those found in rheumatoid arthritis. The natural 
antagonist (called IL-lra for IL-1 receptor antagonist) 
is partially homologous to IL-1 itself, and xs a 
competitive inhibitor of interleukxn-1 binding to the 
receptor. The natural IL-lra is a pure antagonist, 
completely without agonist activity at the ^st 
concentrations used in the work cited above. IL-lra 
synthesized as a protein with 177 amino acids; after 
post-translational cleavage the active inhibitor has 
152 amino acids and, additionally, is glycosylated 
However, the activity of recombinant IL-lra, without 
glycosylate, is comparable to the activity of the 
natural inhibitor. . , 

SPERT is used to develop a polypeptide lxgand 
antagonist for the interleulcin-1 receptor. Two methods 
are used. In the first monoclonal antibodies are 
raised against interleukin-l that are able to cross- 
react with IL-lra . Such monoclonal antibodies xn 
principle recognize the features in common between IL-1 
and IL-lra. Those monoclonal antibodies are used, as 
in Example 1, to develop polypeptide ligands that bmd 
to the antigen combining site; such polypeptide Ixgands 
are candidates for a novel class of IL-1 antagonists. 
Since one goal in this case is to provide antagonists 
smaller than the natural IL-lra, the randomized 
polypeptide is ca. 50 amino acids, as in Example 3. 

In a second methodology the extracellular 
domain of the IL-1 receptor is itself used as the 
target for polypeptide ligand development through 
SPERT. the domain is attached to an insoluble matrix. 
Candidate polypeptide ligands, residing in ribosome 
complexes, are partitioned on the matrix. The matrix 
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is eluted with high concentrations of IL-1, thus 
displacing the ribosome complexes and nascent 
polypeptides with the natural ligand known to bind to 
the desired active site on the receptor. Cycles of 
SPERT are continued until high affinity polypeptide 
ligands are identified. 

very high affinity, even covalent, antagonists 
of the receptor are isolated by an elation protocol 
during SPERT that denatures the ribosome complexes even 
if the polypeptide ligand remains strongly bound to the 
receptor. The mRNA eluted from the column under 
protein denaturing conditions is used to prepare cDNA 
which is amplified through PGR, after whxch 
transcription provides mRNA for the next round of 
SPERT. 

All genes encoding polypeptide ligands are 
sequenced, and the polypeptide ligands are tested for 
IL-1 receptor antagonism. Those ligands identified by 
receptor-based affinity chromatography are tested with 
the antibodies of the first method to screen for the 
novel antagonists recognized by those antibodies that 
recognize structural or sequence homology between IL-1 
and IL-lra. Novel, SPERT-generated polypeptide ligands 
having IL-1 receptor antagonist activity are isolated 
and characterized. SPERT-generated antagonists having 
less than 50% amino acid homology with natural IL-lra 
are identified. In addition, SPERT-generated 
antagonists having less than 30% amino acid homology 
are identified. 

Example 6. Protein improvement by SPERT: Mutagenesis 
and selection of better natural 
insecticides. 
Bacillus thuriengiensis is a gram-positive, 

spore-forming bacteria which produces insecticidal 

proteins. These proteins, derived from different B. 

thuringiensis strains, have varying effectiveness for 
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Wiling insect larvae of different species Although 
L specific protein -ill Kill the insect larvae of a 
variety of species, the effectiveness toward the 
Afferent insect targets (measured as the level of 
protein reguired to produce 50% mortality, can vary by 
L much as 2000-fold. The mechanism of action or 
these insecticide proteins is to hind a receptor on the 
gut membranes of the susceptible insect larva Such 
Cranes serve as a functional partitions tool » 
SPERT 

We create double-stranded DNA templates 
suitable for SPERT by PCH; the appropriate DNA encodes 
the N-terminal 646 amino acid portion of the 
insecticidal protein from t. subspecies kurstakx HD-1, 
Tilh L fully active (Fischhoff et al.. Biotechnology 
5-807-813 (1987). This protein kills the larva of 
loZo hornwoJand cabbage looper very effectively at 



tomato hornworm anu 

low concentration, substantially more protern is 
reguired to kill tobacco hudworm. com earworm, blacK 
cutworm, European cornborer. and beet armyworm Gut 
membranes from each of these insect larvae will be used 



as partitioning agents in SPERT . 

The starting material in these experiments is 
PNA derived from the cloned gene, as above. Two 
methods are used to create protein variants. In one 
method mutagenic PCE provides random mutations 
throughout the 646 amino acids of the msec trorde 
£ili ed codons within the insectioide, using about 50 
^i„o acid replacements, m particular, randomized 
is used to replace the codons encoding the 
hyp ervariable region of the Bt. toxin. Hounds of SPERT 
are oontinued until a desired level of binding to gut 
^mbranes is achieved. The DBA products are cloned and 
seguenced and individually assayed for effectiveness in 
3ing membranes and larval Killing. Effective toxins 
are selected by SPERT, having a naturally-occurring 
sequence replaced by a sequence that is less than 50. 
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homologous with the replaced sequence. In addition, 
toxic, SPERT-generated variants are identified wherein 
the original, naturally-occurring sequence is replaced 
by a sequence having less than 30% sequence homology 
with the replaced sequence. 

Example 7. Anti-viral polypeptide ligands: 

Inhibition of viral entry into target 
cells. 

Receptors are often used for viral attach on 
cells. Recently Kaner et al. (Science, 248:1410-1413 
(1990)) described the basic fibroblast growth factor 
(FGF) receptor as the likely portal through which 
Herpes Simplex Virus Type 1 (H8V) enters a cell. In 
that same paper, by citation of other work several 
other viruses are said to utilize other receptors to 
gain cellular entry. Rhinovirus, the common cold 
virus, is said to enter cells through a cell adhesion 
molecule ICAM-1. HIV, the AIDS virus, enters cells 
through the CD4 glycoprotein receptor. Epstein-Barr 
virus enters T lymphocytes via the C3d complement 
receptor. Rabies virus enters nerve cells through the 
acetylcholine receptor. Reovirus enters cells through 
the beta-adrenergic receptor. Vaccinia virus enters 
cells through a functional interaction with the 
epidermal growth factor receptor. Apparently viruses 
survive in part by using absolutely crucial cell 
receptors to gain entry into susceptible hosts. That 
is host organisms can not easily alter such important 
receptors so as to become resistant to the virus 
without suffering some impairment of crucial cell and 
organism functions. 

Polypeptide ligands of the invention are 
identified that diminish viral uptake through receptors 
while still allowing critical growth factors to 
function. The basic FGF receptor is used to 
demonstrate a successful strategy. The soluble domain 
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with high concentrations of FGF itseir. 

1^ J. that are not displaced by HSV but are 

^solaced by FGF contain nascent polypeptides that are 

„ polypeptide* bind FGF receptor ^ a way 

- ^ ?ZZ are " Ho find the most 

ri^rrxipsrtbat ta — — ~ ~ 

Lt with HSV. candidate polypeptides are assayed for 
20 their negative impact on HSV infection 

inability to prevent FGF-mediated cell growth The 
rueful polypeptide ligands in this >fc 
neither antagonists nor agonists of the FGF receptor 
Orations that ^J^^Z* using the 

Srerla having less than so, amino -id biology with 
FGF is isolated, in addition, a polypeptide Meeting ■ 
L criteria having less than 30% homology with FGF 



isolated - 



~«ole 8 Polypeptide ligands that enter cells: The 
ExamP le 8. ^J^^ receptor and troja n horse 



ligands. 



The glucocorticoid receptor protein binds 
steroid hormone, after which the receptor protein is 
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internalized from the membrane so that the receptor can 
make its way into the cell nucleus. The receptor has a 
DNA binding domain (DBD) that interacts in the nucleus 
with target DNA sequences. Polypeptide ligands of the 
invention, agonists of the glucocorticoid receptor, are 
internalized along with the receptor, and thus directed 
sequentially to the cytoplasm and. then to the nucleus. 
Depending on the dissociation rate constant for 
specific polypeptide ligands, these ligands largely 
reside after uptake in either the cytoplasm or the 

nucleus. . 

Using the randomized starting material of 
Example 3, SPERT is directed toward the glucocorticoid 
receptor, either with indirect immunoprecipitation or 
affinity chromatography using bound receptor. As in 
prior example, SPERT protocols are manipulated so that 
polypeptides are found that compete directly for the 
glucocorticoid binding domain but that have much lower 
affinity than that observed for steroid hormones. As 
the polypeptide ligands evolve, screening of potential 
ligands is performed on individual candidates; thus 
resistance to proteolysis of the polypeptide ligand is 
tested using whole cell entry prior to the protease 
challenge, and testing both cells with and without an 
abundance of the glucocorticoid receptor. Polypeptide 
ligands that enter cells are localized in the cytoplasm 
or nucleus by means available to those skilled in the 
art. Those polypeptide ligands that enter cells with 
proper localization are fused to other polypeptide 
ligands to provide cell entry for molecules with other 
useful activities. 

Example 9. Polypeptide ligands toward nucleic acids: 
Inhibitors of transcription. 

Cancer cells can result from the over- 
expression of a transcriptional activator protein that 
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15 
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functions to enhance transcription and subsequent 
egression of sets of genes that push the cell toward 
inappropriate and uncontrolled growth. Thus, mutations 
"levate the activity of a transcriptional enhancer 
^ cause cancer through enhancement of the expression 
" f a set of genes relevant for growth control. Such 
tumors are treatable with polypeptide ligands that 
reset the appropriate level of ^ 
of the transcriptional enhancer, while it is nicely 
ir^ypeptide ligands may he aimed at the enhancer 
Protein utreetly, thus inhibiting the activity and 
Z e«ing a proper growth rate, in the 
a polypeptide ligand is aimed at the production rate 
the transcriptional enhancer. 

The polypeptide ligand of interest binds to the 
qenome of the cancer cell at a location that competes 
for transcription of the gene that encodes the 
for era * _„,. vator protein, and hence expression 

transcriptional activator protei „„.n_ 
of that protein. That is, in classical genetic 
language, the polypeptide ligand is a specific 
transcriptional repressor. 

The starting materials of Example 3 are used to 
generate a mixed pool of candidate polypeptides. A 
specific sequence of double-stranded DNA is prepared by 
comical meals and covalently attached to an insoluble 
column matrix. The ^"IT^ 

oontaining nascent polypeptide ligands that interact 
with double-stranded DNA (either with sequence 
specificity or not, are retarded on the column 
recovered, and placed into the SPERT protocol of mRHA 
amplification, transcription, and a second cycle^ In 
order to eliminate polypeptide ligands with affinity 
for all double-stranded DHA (that is. without adequate 
sequence specificity for the intended use, . the 
ribosome complexes are mixed with random soluble 
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double-stranded DNA sequences prior to the column 
partitioning step. The soluble DNA concentration is 
adjusted to give about tenfold more non-specific DNA 
during the partitioning step than is the abundance of 
specific DNA sequences attached to the column. In this 
manner polypeptide ligands that are indifferent to DNA 
sequence emerge from the column along with ribosome 
complexes containing polypeptide ligands that are 
unable to bind DNA at all. 

Polypeptide ligands aimed at a specific DNA 
sequence are characterized further. Randomized DNA 
sequences are used to establish which nucleotide pairs 
in the covalently attached DNA are required for avid 
binding of the polypeptide (using the SELEX protocol 
described in U.S. Patent Serial No. 07/536,428). A 
second SPERT is directed toward the contiguous DNA base 
pairs that are not bound by the first isolated 
polypeptide ligand, and the genes for the first and 
second polypeptide ligands are combined to yield a 
polypeptide ligand fusion (in either order, and 
containing a flexible peptide linker) to provide a 
polypeptide ligand with higher specificity and avidity 
than is available from either polypeptide ligand by 
itself. This improvement in specificity and avidity is 
an example of walking, although in this case the 
"steps- are made independently and the polypeptide 
ligands joined post-identification. 

The sequence of double- stranded DNA chosen in 
this example must overlap a transcriptional initiation 
signal. The ras oncogene transcriptional initiation 
region is chosen first. 

Example 10. Human c-myc protein epitope. 

This experiment shows that it is feasible to 
select an epitope or epitopes from a random mixture of 
RNA-encoded peptides. An antibody was chosen which 
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reco^s an epitope in huuan c-myc P^-*~££Z 
of Z amino acid sequence Glu-Gln-I-ys-Iso-ser-Glu Glu 

• «»!., for either eukaryotic or prokaryotic 
ZZZZL?****"-- — n sites for ran- or 
non-random sequences which would encode "ascent 
peptides accessible to selection on nbosOMes and a 
fixed translated sequence ( 3 ' — FTK) which encodes 
p^ide sequences which are buried in the 
rioosce. Refer to Table 3. The T7 promoter sequence 
was added to the euKaryotic 5- OTP throuqh PCR with 
Tiqos 1 and 2 from Table 3 using plasmid P SPBP4 which 
is described by Sieqel and Walter. «. 3»-*». 

1988) The 3-FTR was obtained by PCR of the same 
P "slid usinq oliqos , and 10 from Table 3. These two 

s'-UTR and 3«-FTR were cut with Nhel and 
Sfted The Uqated fraqment was purified and further 

7* or to cloninq into the BindlH and BamHI sites 
ofpBSSK. (purchased from strateqene Systems Incite 
create the plasmid pPSX-EOK. The prokaryotic 5 UTR 
Till be cloned usinq oliqos 3 and 4 from Table 3 into 
the HindHI and Hhe 1 site of pPSX-BUK to create pPSX 
PROK replacinq the euricaryotic ribosome binding site 

a ProKarytic one. The myc epitope encodinq insert 
is obtained by PCRin, the template oliqo 7 with the 
oliqos 5 and 6, all from Table 3, and the variable 
in Ire <for eiqht amino acids, is obtained by PCRinq 
. . tml)lat e oliqo 8 with the oliqos 5 and 6, from 
Ta:^ These Inserts will be diqe S ted with Nhel and 
ITr and liqated in the presence of "*-*»^~~ 
pPSX-BOK and pPSX-PROK. (This was done for the myc 
Insert in pPSX-EUK, . Thus there will be a positive 
control myc 

epitope-encoding expression system whxch can be 
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translated by eukaryotic translation systems and 
separately by prokaryotic translation systems and 
variable nascent peptide-encoding system which can be 
likewise variably translated, and a system with no 
5 inserts which can serve as an internal control for 

comparing extents of enrichments by selection of 
polysomes by the anti-myc antibody. Further testxng 
will identify what 3' ends will give the stablest 
polysome complexes; this may be accomplished by using 
10 oligos 10 in PGR (with oligo 1) to create multiple 
histidine codons for translation with no added 
histidine, with oligo 11 for normal unstopped 
translation with no amino acid depletion, and to test 
the extent of translation using oligo 12 which puts two 
15 stop codons allowing repeated translation of individual 

mRNAs. 
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™ ^ Mt hoa for »a*in g a peloid* Xi g and of a 
target molecule comprising: 

a) synthesizing a translatable mRNA mixture 
comprising a ribosome binding site, 
translation initiation codon and a 
randomized sequence coding region; 

b) synthesizing a mixture of ribosome 
complexes, each member thereof comprising 
a ribosome, a nascent polypeptide and a 

10 translated mRNA, said mRNA having a 

randomized coding region and said nascent 
polypeptide being the translation product 
of said mRNA; 

c) partitioning the rihosome complexes with 
respect to binding of the ribosome 
complexes to a desired target molecule, 
thereby separating the ribosome complexes 
into ribosome complex-target pairs and 
unbound complexes, the ribosome complex- 

^ target pairs having mRNA enriched for 

sequences encoding target-binding 
polypeptides ; 

d) amplifying the mRNA of partitioned 
ribosome complex-target pairs to yield a 
translatable mRNA mixture comprising a 
ribosome binding site, an initiation codon 
and a coding region enriched for sequences 
encoding target-binding polypeptides; 

e) repeating steps b) through d) using the 
mRNA enriched for sequences encoding 
target-binding polypeptides of each 
successive repeat as many times as desired 
to yield a desired level of target binding 

35 by a polypeptide encoded by the mRNA 

enriched for sequences encoding the 
polypeptide; and 
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f) synthesizing a polypeptide encoded by the 
enriched mRNA of step e) , thereby making a 
polypeptide ligand of a target molecule. 

The method for selecting a polypeptide ligand 
of a desired target molecule from a polypeptide 
mixture comprising: 

a) synthesizing a polypeptide mixture each 
member thereof having attached thereto 
amplifying means for separately amplifying 
the individual polypeptide to which it is 
attached ; 

b) partitioning the polypeptide mixture with 
respect to binding the target molecule, 
thereby separating the mixture into 
polypeptide-target pairs' and unbound 
polypeptides ; 

c) amplifying the polypeptides of 
polypeptide-target pairs using said 
amplifying means; and 

d) repeating the partitioning and amplifying 
steps to select a polypeptide ligand of a 
desired target molecule. 

The method of claim 2 wherein the polypeptide 
mixture comprises polypeptides having a segment 
of randomized amino acid sequence. 

The method of claim 3 wherein the segment of 
randomized amino acid sequence is from 4 to 50 
amino acids in length. 

The method of claim 3 wherein the amplifying 
means comprises an mRNA mixture, each member 
thereof encoding a polypeptide of the 
polypeptide mixture and being attached to the 
polypeptide it encodes as part of a ribosome 
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"complex - 

The method of claim 3 wherein the step of 
amplifying the polypeptides comprises the 
additional step of amplifying the mRNA mixture. 

The method of claim 6 wherein the mRNA mixture 
is amplified by reverse transcription and a 
polymerase chain reaction. 

A method for making a polypeptide ligand of a 
target molecule comprising: 

(a) synthesizing a mRNA mixture comprising 
translatable and nontranslatable regions, 
wherein said translatable region comprises 
randomized and fixed sequence coding 
regions; 

(b) synthesizing a mixture of mRNA* polypeptide 
copolymers, each member comprising an mRNA 
and a polypeptide encoded by its 
associated mRNA, wherein a portion of said 
nontranslatable region of said mRNA and a 
portion of said polypeptide encoded by 
said fixed sequence coding region form a 
binding interaction; 

(c) partitioning the mRNA«polypeptide 
copolymers with respect to affinity of the 
copolymers to a desired target molecule; 

( d) amplifying the mRNA of partitioned 
copolymers to yield a translatable mRNA 
mixture; and 

(e) synthesizing a polypeptide or polypeptides 
encoded by the mRNA mixture of step (d) . 

The method of claim 8 further comprising the 
steps of repeating steps (a) through (d) using 
the mRNA mixture of step (d) in successive 



cy iRfroTiJTE SHEET 



WO 93/03172 



PCT/US92/00801 

79 

cycles repeating as many times as desired to 
yield copolymers with the desired affinity to 
the target. 

The method of claim 8 wherein the target 
molecule is a protein. 

The method of claim 10 wherein the protein is 
an enzyme. 

The method of claim 10 wherein the protein is 
an antibody. 

The method of claim 10 wherein the protein is a 
receptor . 

The method of claim 10 wherein the protein is a 
nucleic acid binding protein. 

The method of claim 10 wherein the protein is a 
toxin. 

The method of claim 10 wherein the protein is a 
glycoprotein . 

The method of claim 10 wherein the protein is 
an antigen. 

The method of claim 8 wherein the polypeptide 
is an inhibitor of function of the target 
molecule. 

The method of claim 8 wherein the target 
molecule is a cell membrane component. 

The method of claim 8 wherein the target 
molecule is a virus component. 
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24. 



25. 



30. 



31. 



33. 



The method of claim 8 wherein the target 
molecule is a carbohydrate. 

The method of claim 8 wherein the target 
molecule is a polysaccharide. 

The method of claim 8 wherein the target 
molecule is a lipid. 

The method of claim 8 wherein the target 
molecule is a glycolipid. 

The method of claim 8 wherein the target 
molecule is a toxin. 

The method of claim 8 wherein the target 
molecule is a drug. 

The method of claim 8 wherein the target 
molecule is a controlled substance. 

The method of claim 8 wherein the target 
molecule is a metabolite. 

The method of claim 8 wherein the target 
molecule is a cof actor. 

The method of claim 8 wherein the target 
molecule is a nucleic acid. 

The method of claim 8 wherein the target 
molecule is a hormone. 

The method of claim 8 wherein the target 
molecule is a receptor ligand. 

The method of claim 8 wherein the target 
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molecule is a transition state analog. 

The method of claim 8 wherein the partitioning 
is carried out by column chromatography. 

The method of claim 8 wherein the partitioning 
is carried out by binding to target molecules 
attached to a solid phase matrix. 

The method of claim 8 wherein the partitioning 
is carried out by immunoprecipitation. 

The method of claim 8 wherein the partitioning 
is carried out by indirect immunoprecipitation. 

The method of claim 8 wherein' the mRNA is 
amplified in step d) by polymerase chain 
reaction . 

The method of claim 8 wherein the process of 
amplifying in step d) includes introducing 
mutations during amplification. 

The method of claim 8 wherein step f ) is 
carried out by chemical synthesis of the 
polypeptide ligand. 

The method of claim 8 wherein the mRNA 
additionally comprises a sequence encoding a 
segment of polypeptide that functions to bind a 
bridging molecule and step c) further comprises 
binding target molecules to a solid phase 
matrix and binding to the target molecules an 
anchor molecule covalently bound to the 
bridging molecule, the anchor molecule being 
capable of specifically binding the target 
molecules whereby mRNA* polypeptide copolymers 



SUBSTITUTE SHEET 



PCT/US92/00801 



43. 



82 

bind to the bridging molecule anchored to the 
target molecules. 

Th e method of claim 8 comprising the additional 
steps of synthesizing a second translatable 
mRNA mixture comprising the mRNA selected by 
steps a) - e) and a second randomxzed sequence 
coding region, and repeating steps b> - •> 
using the second translatable mRNA mixture to 
yield a desired level of target binding by a 
polypeptide encoded by the second mRNA enrxched 
for sequences encoding the polypeptide. 

A mixture of mRNA • polypeptide copolymers 

comprising: 

an mRNA comprising untranslatable 
portions and translatable portions; 

a polypeptide encoded by said mRNA 
comprising random and fixed sequence regxons, 
wherein said mRNA and polypeptide are bound 
together by at least a portion of the 
untranslatable portion of said mRNA and at 
least a portion of the fixed sequence regxon of 
said polypeptide. 

25 44 A polypeptide that is a ligand of a target 

molecule prepared according to the method 
described in claim 8. 

30 45. A method for making a polypeptide ligand of a 

target molecule comprising: 

(a) synthesizing a mRNA mixture of at least 

10 1 < sequences comprising translatable and 
nontranslatable regions; 

(b , synthesizing a mixture of mRNA-polypeptxde 
copolymers, each member comprising an mRNA 
and a polypeptide encoded by its 
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associated mRNA, and not containing a 
ribosome; 

partitioning the mRNA • polypeptide 
copolymers with respect to affinity of the 
copolymers to a desired target molecule; 
amplifying the mRNA of partitioned 
copolymers to yield a translatable mRNA 
mixture ; and 

synthesizing a polypeptide or polypeptides 
encoded by the mRNA mixture of step (d) . 



The method of claim 45 wherein said 
mRNA* polypeptide copolymers are synthesized by 
the post-translational or co-translational 
interaction between a portion of the 
nontranslatable portion of said mRNA and a 
portion of said polypeptide. 

The method of claim 45 wherein said 
mRNA -polypeptide copolymers are synthesized by 
crosslinking the polypeptide-tRNA-mRNA complex 
after translation of the mRNA. 

The method of claim 45 wherein said 
mRNA* polypeptide copolymers are synthesized by 
linking the 5' nucleic acid sequence of the 
mRNA to the initial amino acid sequences of the 
polypeptide prior to translation. 



47. 

20 



48. 

25 
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TABLE 2 



1.1 i* Ii*ed sequence 

Hindi II Ribosome binding site EcoRI 
site * l 

i I 

- T7 promoter - 



2. ) Stratagene polylinker cloning site ( P BSSK*» 

PstI 

5 . -TCGATAAGCTTGATATCG^TTCCTGCAGCCCGGGGGXTCCACTAG- 3 ' 
Hindlll EcoRI BamH1 



3.) 3' primer annealing site and 



insertion sequence cloning sites 



EcoRI PatI Meol 



.ligonucleotides to be cloned at the 



4.) Randomizing ©J 



R.I 5' -CCCG^ATTC- t -+5N-1 -CTGCAGTGCTGCCATGGT-3' 
3' -GTCXCGKCGGTACCA-5* 



:cAACGCTGCCTCTCCTCCTAGGCCGCC- I 
EcoRI, PstX. and HcoX sites. 
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TABLE 3 

5'UTR 

5 * — GGGAAGCTTAATACGACTCACTATAGGGAGCTTGTTCTETTTTCCAGAAGCTCAG-3 * 

.*•«**•*••*•********•**•** ************************ 

^TutT^ "rimer "or "cRlng the 5' untranslated region prior to ligation, 

5'-CTCGGCGCTAGCCATGGTGATCTGCCAAAGTTGAG-3' 

3. PROTOP (5* primer for fixed proke OTR-RBS PCR and cloning) 
5»- CCGAAGCTTAATACGACTCACTATAGGGTAAGATAAGATAAGGAGGAAAATAAAATGG -3' 

4. PROBOT (Complement to Protop for cloning proke OTR-RBS) 

5'- CTAGCCATTTTATTTTCCTCCTTATCTTATCTTACCCTATAGTGAGTCGTATTAAGCTTCGG -3' 

Insert 

V* 5'insertPriroer (for amplifying insert) 
5' -GGGCCATGGCTAGCGCCGAGGA-3 ' 

6. PM3 (3' primer for fixed epitope (EPI) end variable region (VAR) PCR, 

sequencing and (maybe) cloning) 
5 ' -GGCGGATCCAGGCGGGACCCTT TCTGCGACGAA-3 ' 

7. MycCODE (oligo for EPI construction) 

5 ' -CGGCCATGGCTAGCGCCGAGGAGCAGAAGCTGATCTCCGAGGAG<WCCTGCTGGAATTCGTCGCAGAA^GG6TCCCC-3 , 

8. VarCODE (oligo for VAR construction) 

5' -GGGCCA.TCGCTAGCGCCGAGGAGKNNNNNNNNNNNNNNKNNNNNNNNCl^ 



Ql inCTITI ITC C-ur-r-TT 
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| TABLE 2- 4 b. | 



[TABLE 2-4o) 



C-TERMINUS TRAILER 
INSERTION SITE 



VARIABLE SEQUENCE 
INSERTION SITE 



Hind III 
SITE 



^4= jT7 PRO) 



EcoRl 
SITE 



Pstl 

SITE 

=4= 



NCOl 
SITE 



omHl 
sr 



L 



RIBOSOME 
BINDING SITE 



J 



-TABLE 2-1— 



L, J 

3 PRIMER 
ANNEALING 
SITE 



-TABLE 2-3- 
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Integrin OvP 3 binding protokol. 



Plates: Nunc Immunomodule U8 Maxisorp. Biotecline cat nun-475078. 



REAGENTS 

Coating Integrin receptor in PBS 

Blocking buffer TBS, 1% BSA (Sigma A-7030) 

Wash buffer TBS, 0.1% Tween 20 (Sigma P-9416) 1 mM MnCI 2 

Ligand binding buffer TBS, 0.01% Tween 20, 0.1% BSA, 1 mM MnCI 2 

HRP-streptavidin 1:10000 dilution in Ligand binding buffer 

HRP-substrate TMB PLUS substrate (KemEn-Tech. cat 4390 -500 ml) 

Stop 0.2 M sulphuric acid 

Coating 

Coating with 1-3 ng/mL integrin receptor (0.1-0.3 pg/well) over night at room temp should be sufficient 
for detection of ligand binding. 

Blocking 

1 hour at room temp. 
Washing 

Washing is done after blocking, ligand binding and HRP-streptavidin binding. 
Ligand binding 

Ligand binding at rt. for 90 min. MnCI 2 is essential for ligand binding. 
PROTOKOL: 



1. 


Coat 96 well plates with 100 pi diluted integrin receptor o.n at 4"C 


2. 


Block for 1 hour with 250 ul blocking buffer. 


3. 


Wash with 2 x 250 pi wash buffer. 


4. 


Bind 4-10 p mol ligand in 100 ul ligand binding buffer at rt. for 90min. 


5. 


Wash with 7 x 250 ul wash buffer. 


6. 


Incubate with 100 pi HRP-streptavidin 1:10000 dilution in wash buffer, 1 hour at rt. 


7. 


Wash with 7 x 250 ul wash buffer. 


8. 


Ad 100 pi TMB substrate. Incubate at rt. until color development. 


9. 


Ad 100 Ml stop 


10. 


Read at 450 nm in Microplate reader. 



Nb ! Mn 2+ Is essential in all buffers to maintain ligand binding. 



If working with photo cleavable spacers all incubations should be in the dark. 
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